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TWO HUMAN NSP-LTKE PROTEINS 

The present invention relates to nucleic acid and amino acid 
5 sequences of two novel human NSP-like proteins and to the use of these 

sequences in the diagnosis, study, prevention and treatment of disease. 

BACKGROUND ART 

Neuroendocrine-specif ic proteins (NSP-A/ NSP-B, and NSP-C) are a 
recently characterized group of membrane-anchored endoplasmic reticulum . 

10 (ER) proteins that share identical carboxy-terminal amino acid sequences 

(van de Velde HJ et al (1994) J Cell Sci 107:2403-2416). Evidence 
suggests that NSP-A and NSP-C expression is restricted to neuronal and 
endocrine cell populations (van de Velde, supra) . Immunohistochemical 
studies showed that rat NSP-A is expressed throughout the rat brain (van 

15 ■ de Velde KJ et al (1994) Mol Brain Res 23:81-92). NSP-B, however, is 

found only in a small cell lung carcinoma ceil line and probably 
represents an aberrant NSP gene product (Roebroek AJ et al (1993) J Biol 
Chem 268:134 39-13447), A previously reported neuronally expressed rat 
gene, CI-13, and two partially sequenced human cDNAs (GI 39104 3 and GI 

20 894620), have a high degree of homology to NSPs which suggests that NSPs 

belong to a larger family of proteins (Wieczorek DF et al (1991) Mol 
Brain Res 10:33-41; Bell GI et al (1993) Hum Mol Genet 2:1793-798; 
Martin-Galla A et al (1992) Nat Genet 1:34-39). 

Two large hydrophobic regions characterize the NSPs and homologous 

25 proteins and suggest membrane association. In fact, immunofluorescence 

and biochemical studies have established an association between NSPs and 
membranes of the ER (Senden NH et al (1994) Eur J Cell Biol 65:341-353). 
Analysis of NSP-A deletion mutants indicates that the carboxy-terminal 
hydrophobic region is necessary for membrane binding (van de Velde et al, 

30 supra) . Carboxy-terminal amino acid sequences of the NSPs are highly 

homologous, although they are not a perfect match to a consensus motif 
sufficient for retention of transmembrane proteins in the ER (van de 
Velde, supra; Jackson MR et al (1993) J Ceil Biol 121:317-333). Thus, it 
appears likely that NSPs and related proteins are targeted to the ER by 

35 conserved carboxy-terminal amino acids. 

Immunostaining with* anti-NSP-A antibodies suggests that NSP-A may 
be associated with both the rough and smooth neuronal ER. On the basis 
of this evidence and knowledge of neuronal ER function, van de Velde et 
al (1994; supra) conclude that NSPs may be involved in the protein 

40 transport process or in the regulation of intracellular calcium levels in 

neuronal cells. 
l^SP-liK^ Prg^gjCng and D^gwgo 

Dysfunction of ER-mediated neuronal protein transport may 
contribute to neurodegenerative diseases. for example, in amyotrophic 
45 lateral sclerosis (ALS) , a degenerative disease of motor neurons, 
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position of neurofilaments in neuronal axons leads to dramatic defects in 
ER-mediated axonal transport of a variety of proteins (Coliard JF et al 
(1995) Nature 375:61-64). Defects in protein transport have been further 
implicated in the pathogenesis of ALS by a transgenic mouse study in 
which ALS is modeled by a mutation in superoxide dismutase (SOD). SOD 
mutant animals displayed clinical and pathological features of human ALS 
and showed axonal transport defects associated with dilation of the ER 
(Mourelatos Z et al (1996) Proc Natl Acad Sci 93:5472-5477). 

Analysis of specimens of a wide variety of primary human tumors 
show that NSP-A and NSP-C are expressed in small cell lung carcinoma, 
carcinoid tumors of the lung, but not in non-neuroendocrine non-small 
cell lung carcinomas (van de Velde et al (1994) Cancer Res 54:4769-4 776). 
Furthermore, antibodies generated to small-cell lung carcinoma surface 
antigens recognize NSP-A, NSP-B, and NSP-C. Therefore, NSPs may act as 
marJcers in human lung cancer diagnosis and prt>vide an avenue for 
corrective treatment (Senden NH et al (1994) Int J Cancer Suppl 8:84-88). 

New NSP-lilce proteins could satisfy a need in the art by providing 
new means of diagnosing and treating cancer and neurodegenerative 
disorders such as ALS. 

DISCLOSURE OF THE INVENTION 

The present invention discloses two novel human NSP-li)ce proteins 
(hereinafter referred to individually as NSPLPA and NSPLPB, and 
collectively as NSPLP) , characterized as having homology to human NSP-A 
(GI 307307), NSP-B (GI 307309), NSP-C (GP 307311), and rat CI-13 (GI 
281046) . Accordingly, the invention features two substantially purified 
NSP-lilce proteins, as shown in amino acid sequence of SEQ ID N0:1 and SEQ 
ID NO: 3, and having characteristics of NSPs. 

One aspect of the invention features isolated and substantially 
purified polynucleotides which encode NSPLP. In a particular aspect, the 
polynucleotide is the nucleotide sequence of SEQ ID N0:2 or SEQ ID NO:4. 
In addition, the invention features polynucleotide sequences that 
hybridize under stringent conditions to SEQ ID NO: 2 or SEQ ID NO: 4. 

The invention further relates to nucleic acid sequences encoding 
NSPLP, oligonucleotides, peptide nucleic acids (PNA), fragments, portions 
or antisense molecules thei^eof , and expression vectors and host cells 
comprising polynucleotides which encode NSPLP. The present invention 
also relates to antibodies which bind specifically to NSPLP, 
pharmaceutical compositions comprising substantially purified NSPLP, 
fragments thereof, or antagonists of NSPLP, in conjunction with a 
suitable pharmaceutical carrier, and methods for producing NSPLP, 
fragments thereof, or antagonists of NSPLP. 

BRIEF DESCRIPTION OF DRAWINGS 
Figures lA, IB and IC show the amino acid sequence (SEQ ID NO:l) 
and nucleic acid sequence (SEQ ID NO: 2) of the novel NSP-like protein. 
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NSPLPA. The alignment was produced using MacDNAsis software (Hitachi 
Software Engineering Co Ltd) . 

Figures 2A, 2B and 2C show the amino acid sequence (SEQ ID NO: 3) 
and nucleic acid sequence (SEQ ID NO: 4) of the novel NSP-like protein, 
5 NSPLPB (MacDNAsis software, Hitachi Software Engineering Co Ltd) . 

Figures 3A, 3B, 3C, 3D and 3E show the northern analysis for the 
consensus sequence (SEQ ID NO: 4). The northern analysis was produced 
electronically using LIFESEQ* database (Incyte Pharmaceuticals, Palo Alto 
CA) . 

10 Figures 4A, 4B and 4C show the northern analysis for Incyte Clones 

31870 (SEQ ID NO: 2) (LIFESEQ^ database, Incyte Pharmaceuticals, Palo Alto 
CA) . 

Figure 5 shows the assembly for the consensus sequence (SEQ ID 

N0:2) . 

15 Figures 6A, 6B, 6C, 6D, 6E and 6F sho^; the amino acid sequence 

alignments among NSPLPA (SEQ ID N0:1), NSPLPB (SEQ ID N0:3), NSP-A (GI 
307307; SEQ ID N0:5), NSP-B (GI 307309; SEQ ID N0:6), NSP-C (GI 307311); 
SEQ ID N0:7), and rat CI-13 (GI 281046 SEQ ID NO:8) produced using the 
multisequence alignment program of DNAStar software (DNAStar Inc, Madison 

20 WI). 

Figure l shows the hydrophobicity plot (generated using MacDNAsis 
software) for NSPLPA, SEQ ID N0:1; the X axis reflects amino acid 
position, and the negative Y axis, hydrophobicity (Figs. 7, 8, and 9) . 
Figure 8 shows the hydrophobicity plot for NSPLPB, SEQ ID NO: 3. 
25 Figure 9 shows the hydrophobicity plot for NSP-C, SEQ ID NO: 7. 

MODES FOR CARRYING OUT THE INVENTION 

Definitions 

''Nucleic acid sequence" as used herein refers to an 
oligonucleotide, nucleotide or polynucleotide,' and fragments or portions 

30 thereof, and to DNA or RNA of genomic or synthetic origin which may be 

single- or double-stranded, and represent the sense or antisense strand. 
Similarly, amino acid sequence as used herein refers to peptide or 
protein sequence. 

"Peptide nucleic acid" as used herein refers to a molecule which 

35 comprises an oligomer to which an amino acid residue, such as lysine, and 

an amino group have been added. These small molecules, also designated 
anti-gene agents, stop transcript elongation by binding to their 
complementary (template) strand of nucleic acid (Nielsen PE et al (1993) 
Anticancer Drug Des 8:53-63). 

40 As used herein, NSPLP refers to the amino acid sequences of 

substantially purified NSPLP obtained from any species, particularly 
mammalian, including bovine, ovine, porcine, murine, equine, and 
preferably human, from any source whether natural, synthetic, 
semi-synthetic or recombinant. 
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A "variant" of NSPLP is defined as an amino acid sequence that is 
altered by one or more amino acids. The variant may have "conservative" 
changes, wherein a substituted amino acid has similar structural or 
chemical properties, eg, replacement of leucine with isoleucine. More 
rarely, a variant may have "nonconservative" changes, eg, replacement of 
a glycine with a tryptophan. Similar minor variations may also include 
amino acid deletions or insertions, or both. Guidance in determining 
which and how many amino acid residues may be substituted, inserted or 
deleted without abolishing biological or immunological activity may be 
found using computer programs well known in the art, for example, DNAStar 
software. 

A "deletion" is defined as a change in either amino acid or 
nucleotide sequence in which one or more amino acid or nucleotide 
residues, respectively, are absent. 

An "insertion" or "addition" is that change in an amino acid or 
nucleotide sequence which has resulted in the addition of one or more 
amino acid or nucleotide residues, respectively, as compared to the 
naturally occurring NSPLP. 

A "substitution" results from the replacement of one or more amino 
acids or nucleotides by different amino acids or nucleotides, 
respectively. 

The term "biologically active" refers to a NSPLP having structural, 
regulatory or biochemical functions of a naturally occurring NSPLP. 
Likewise, "immunologically active" defines the capability of the natural, 
recombinant or synthetic NSPLP, or any oligopeptide thereof, to induce a 
specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

The term "derivative" as used herein refers to the chemical 
modification of a nucleic acid encoding NSPLP or the encoded NSPLP. 
Illustrative of such modifications would be replacement of hydrogen by an 
alkyl, acyl, or amino group. A nucleic acid derivative would encode a 
polypeptide which retains essential biological characteristics of natural 
NSPLP. 

As used herein, the term ^'substantially purified" refers to 
molecules, either nucleic or amino acid sequences, that are removed from 
their natural environment, isolated or separated, and are at least 60% 
free, preferably 75% free, and most preferably 90% free from other 
components with which they are naturally associated, 

"Stringency" typically occurs in a range from about Tm-5**C (5*C 
below the Tm of the probe) to about 20*'C to 25°C below Tm. As will be 
understood by those of skill in the art, a stringency hybridization can 
be used to identify or detect identical polynucleotide sequences or to 
identify or detect similar or related polynucleotide sequences. 

The term "hybridization" as used herein shall include "any process 
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by which a strand of nucleic acid joins with a complementary strand 
through base pairing" (Coombs J (1994) Pictionarv SiL Biotechnglogy. 
Stockton Press, New York NY) . Amplification as carried out in the 
polymerase chain reaction technologies is described in Dieffenbach CW and 
5 GS Dveksier (1995, ££E Primer . ^ Labggatgry Manudl. Cold Spring Harbor 

Press, Plainview NY) . 
Preferrg j Eabodimftnta 

The present invention relates to novel NSPLP and to the use of the 
nucleic acid and amino acid sequences in the study, diagnosis, prevention 
10 and treatment of disease. cDNAs encoding a portion of NSPLP were found 

in neuronal and endocrine tissue-derived cDNA libraries and in a variety 
of other tissues, including many types of tumors (Tigs. 3A-3E and 4A-4C) . 

The present invention also encompasses NSPLP variants. A preferred 
NSPLP variant is one having at least 80% amino acid sequence similarity 
15 to the NSPLP amino acid sequence (SEQ ID N0:^1) , a more preferred NSPLP 

variant is one having at least 90% amino acid sequence similarity to SEQ 
ID NO: 1 and a most preferred NSPLP variant is one having at least 95% 
amino acid sequence similarity to SEQ ID N0:1. 

Nucleic acids encoding the human NSPLP of the present invention 
20 were first identified in cDNA, Incyte Clones 31870 (SEQ ID NO: 4; THP-1 

cell cDNA library, THPlNOBOl) and 28742 (SEQ ID NO: 9; fetal spleen cDNA 
library, SPLNFETOl), through a computer-generated search for amino acid 
sequence alignments. A consensus sequence, SEQ ID NO: 2, was derived from 
the following overlapping nucleic acid sequences: Incyte Clones 28742 
25 (from cDNA library SPLNFETOl); 45022, 45074, and 45509 (CORNNOTOl) ; 

121581 (MUSCNOTOl); 570122 (MMLR3DT01); and 754150 (BRATUT02; Fig. 5). 
The nucleic acid sequence of SEQ ID NO: 2 encodes the NSPLPA amino acid 
sequence, SEQ ID N0:1. The nucleic acid sequence of SEQ ID NO: 4 encodes 
the NSPLPB amino acid sequence, SEQ ID NO: 3. The nucleic acid sequence 
30 of SEQ ID NO: 4 from residue C„s to T,o« has 97% identity to the partial 

cDNA sequence of clone hbc043 (GI 39104; Bell et al, supra). 

The present invention is based, in part, on the chemical and 
structural homology among NSPLPA, NSPLPB, NSP-A (GI 307307; Roebroek et 
al, supra), NSP-B (GI 307309; Roebroek et al, supra), NSP-C (GI 307311; 
35 Roebroek et al, supra), and rat CI-13 (GI 281046; Wieczorek et al, supra; 

Figs. 6A-D) . NSPLPA and rfSP-C share 66% identity, NSPLPB and NSP-C share 
48% identity, while NSPLPA and NSPLPB share 50% identity. As illustrated 
by Figures 7, 8, and 9, NSPLPA, NSPLPB, and NSP-C have similar 
hydrophobicity plots suggesting similar structure. Like the NSPs, NSPLPA 
40 and NSPLPB have two large hydrophobic regions that could be used for 

membrane attachment. The carboxy-terminal amino acids Lysi,s through Lys.,, 
of NSPLPA precisely match, in position as well as sequence, an ER 
retention motif defined by Jackson et al (1993; supra). The novel NSPLPA 
is 199 amino acids long and has one potential N glycosylation site. The 
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novel NSPLPB is 241 amino acids long. 
n^Q NSPLP Coding SacruTiCQa 

The nucleic acid and deduced amino acid sequences of NSPLP are 
shown in Figures lA, IB, IC, 2A, 2B and 2C. In accordance with the 
invention, any nucleic acid sequence which encodes the amino acid 
sequence of NSPLP can be used to generate recombinant molecules which 
express NSPLP. In a specific embodiment described herein, a nucleotide 
sequence encoding a portion of NSPLP was first isolated as Incyte Clones 
31870 from a THP-1 cell cDNA library (THPlNOBOl) . While, Incyte Clone 
28742 was first isolated from a fetal spleen cDNA library (SPLNFETOl) . 

It will be appreciated by those skilled in the art that as a result 
of the degeneracy of the genetic code, a multitude of NSPLP-encoding 
nucleotide sequences, some bearing minimal homology to the nucleotide 
sequences of any known and naturally occurring gene may be produced. The 
invention contemplates each and every possible variation of nucleotide 
sequence that could be made by selecting combinations based on possible 
codon choices. These combinations are made in accordance with the 
standard triplet genetic code as applied to the nucleotide sequence of 
naturally occurring NSPLP, and all such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode NSPLP and its variants 
are preferably capable of hybridizing to the nucleotide sequence of the 
naturally occurring NSPLP under appropriately selected conditions of 
stringency, it may be advantageous to produce nucleotide sequences 
encoding NSPLP or its derivatives possessing a substantially different 
codon usage. Codons may be selected to increase the rate at which 
expression of the peptide occurs in a particular prokaryotic or 
eukaryotic expression host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for 
substantially altering the nucleotide sequence encoding NSPLP and its 
derivatives without altering the encoded amino acid sequences include the 
production of RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally occurring 
sequence . 

It is now possible to produce a DNA sequence, or portions thereof, 
encoding a NSPLP and its derivatives entirely by synthetic chemistry, 
after which the synthetic gene may be inserted into any of the many 
available DNA vectors and cell systems using reagents that are well known 
in the art at the time of the filing of this application. Moreover, 
synthetic chemistry may be used to introduce mutations into a sequence 
encoding NSPLP or any portion thereof. 

Also included within the scope of the present invention are 
polynucleotide sequences that are capable of hybridizing to the 
nucleotide sequences of figures lA, IB, iC, 2A, 28, and 2C under various 
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conditions of stringency. Hybridization conditions are based on the 
melting temperature (Tm) of the nucleic acid binding complex or probe, as 
taughc in Berger and Kimmel (1987, Guide Molecular Clorjino 
Techniques . Methods ia Enzvmoloqy . Vol 152, Academic Press, San Diego CA) 
5 incorporated herein by reference, and confer may be used at a defined 

stringency. 

Altered nucleic acid sequences encoding NSPLP which may be used in 
accordance with the invention include deletions, insertions or 
substitutions of different nucleotides resulting in a polynucleotide that 

10 encodes the same or a functionally equivalent NSPLP. The protein may 

also show deletions, insertions or substitutions of amino acid residues 
which produce a silent change and result in a functionally equivalent 
NSPLP. Deliberate amino acid substitutions may be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity, 

15 hydrophilicity, and/or the amphipathic nature of the residues as long as 

the biological activity of NSPLP is retained. For example, negatively 
charged amino acids include aspartic acid and glutamic acid; positively 
charged amino acids include lysine and arginine; and amino acids with 
uncharged polar head groups having similar hydrophilicity values include 

20 leucine, isoleucine, valine; glycine, alanine; asparagine, glutamine; 

serine, threonine phenylalanine, and tyrosine. 

Included within the scope of the present invention are alleles of 
NSPLP. As used herein, an "allele" or ^'allelic sequence" is an 
alternative form of NSPLP. Alleles result from a mutation, ie, a change 

25 in the nucleic acid sequence, and generally produce altered mRNAs or 

polypeptides whose structure or function may or may not be altered. Any 
given gene may have none, one or many allelic forms. Common mutational 
changes which give rise to alleles are generally ascribed to natural 
deletions, additions or substitutions of amino acids. Each of these 

30 types of changes may occur alone, or in combination with the others, one 

or more times in a given sequence. 

Methods for DNA sequencing are well known in the art and employ 
such enzymes as the Klenow fragment of DNA polymerase I, SequenaseO (US 
Biochemical Corp, Cleveland OH)), Taq polymerase {Perkin Elmer, Norwalk 

35 CT), thermostable T7 polymerase (Amersham, Chicago XL), or combinations 

of recombinant polymerases and proofreading exonucleases such as the 
ELONGASE Amplification System marketed by Gibco BRL (Gaithersburg MD) . 
Preferably, the process is automated with machines such as the Hamilton 
Micro Lab 2200 (Hamilton, Reno NV) , Peltier Thermal Cycler <PTC200; MJ 

40 Research, Watertown MA) and the ABI 377 DNA sequencers (Perkin Elmer) . 

The polynucleotide sequence encoding NSPLP may be extended 
utilizing partial nucleotide sequence and various methods known in the 
art to detect upstream sequences such as promoters and regulatory 
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eiemencs. Gobinda et al (1993; PGR Methods Applic 2:318-22) disclose 
"restriction-site" polymerase chain reaction (PGR) as a direct method 
which uses universal primers to retrieve unknown sequence adjacent to a 
known locus. First, genomic DNA is amplified in the presence of primer 
to a linker sequence and a primer specific to the known region. The 
amplified sequences are subjected to a second round of PGR with the same 
linker primer and another specific primer internal to the first one. 
Products of each round of PGR are transcribed with an appropriate RNA 
polymerase and sequenced using reverse transcriptase. 

Inverse PGR can be used to amplify or extend sequences using 
divergent primers based on a known region (Triglia T et al (1988) Nucleic 
Acids Res 16:8166). The primers may be designed using OLIGO® 4.06 Primer 
Analysis Software (1992; National Biosciences Inc, Plymouth MN) , or 
another appropriate program, to be 22-30 nucleotides in length, to have a 
GG content of 50% or more, and to anneal to the target sequence at 
temperatures about 68'*-72** C. The method uses several restriction 
enzymes to generate a suitable fragment in the known region of a gene. 
The fragment is then circularized by intramolecular ligation and used as 
a PGR template. 

Capture PGR (Lagerstrom M et al (1991) PGR Methods Applic 1:111-19) 
is a method for PGR amplification of DNA fragments adjacent to a known 
sequence in human and yeast artificial chromosome DNA. Capture PGR also 
requires multiple restriction enzyme digestions and ligations to place an 
engineered double-stranded sequence into an unknown portion of the DNA 
molecule before PGR. 

Another method which may be used to retrieve unknown sequences is 
that of Parker JD et al (1991; Nucleic Acids Res 19:3055-60). 
Additionally, one can use PGR, nested primers and PromoterFinder 
libraries to walk in genomic DNA (PromoterFinder^ Clontech (Palo Alto 
GA) . This process avoids the need to screen libraries and is useful in 
finding intron/exon junctions. 

Preferred libraries for screening for full length cDNAs are ones 
that have been size-selected to include larger cDNAs, Also, random 
primed libraries are preferred in that they will contain more sequences 
which contain the 5* and upstream regions of genes. A randomly primed 
library may be particularly useful if an oligo d(T) library does not 
yield a full-length cDNA. Genomic libraries are useful for extension 
into the 5' nontranslated regulatory region. 

Capillary electrophoresis may be used to analyze the size or 
confirm the nucleotide sequence of sequencing or PGR products. Systems 
for rapid sequencing are available from Perkin Elmer, Beckman Instruments 
(Fuilerton GA) , and other companies. Capillary sequencing may employ 
flowable polymers for electrophoretic separation, four different 
fluorescent dyes (one for each nucleotide) which are laser activated, and 
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detection of the emitted wavelengths by a charge coupled devise camera. 
Output/light intensity is converted to electrical signal using 
appropriate software {eg. Genotyper'" and Sequence Navigator^ from Perkin 
Elmer) and the entire process from loading of samples to computer 
S analysis and electronic data display is computer controlled. Capillary 

electrophoresis is particularly suited to the sequencing of small pieces 
of DMA which might be present in limited amounts in a particular sample. 
The reproducible sequencing of up to 350 bp of M13 phage DNA in 30 min 
has been reported (Ruiz-Martinez MC et al (1993) Anal Chem 65:2851-2858), 

10 Expreaaio n of the Nuclaotide Sequence 

In accordance with the present invention, polynucleotide sequences 
which encode NSPLP, fragments of the polypeptide, fusion proteins or 
functional equivalents thereof may be used in recombinant DNA molecules 
that direct the expression of NSPLP in appropriate host cells. Due to 

15 the inherent degeneracy of the genetic code^ other DNA sequences which 

encode substantially the same or a functionally equivalent amino acid 
sequence, may be used to clone and express NSPLP. As will be understood 
by those of skill in the art, it may be advantageous to produce 
NSPLP-encoding nucleotide sequences possessing non-nacurally occurring 

20 codons. Codons preferred by a particular prokaryotic or eukaryotic host 

(Murray E et al (1989) Nuc Acids Res 17:477-508) can be selected, for 
example, to increase the rate of NSPLP expression or to produce 
recombinant RNA transcripts having desirable properties, such as a longer 
half-life, than transcripts produced from naturally occurring sequence. 

25 The nucleotide sequences of the present invention can be engineered 

in order to alter a NSPLP coding sequence for a variety of reasons, 
including but not limited to, alterations which modify the cloning, 
processing and/or expression of the gene product. For example, mutations 
may be introduced using techniques which are well known in the art, eg, 

30 site-directed mutagenesis to insert new restriction sites, to alter 

glycosylation patterns, to change codon preference, to produce splice 
variants, etc. 

In another embodiment of the invention, a natural, modified or 
recombinant polynucleotides encoding NSPLP may be ligated to a 
35 heterologous sequence to encode a fusion protein. For example, for 

screening of peptide libraries for inhibitors of NSPLP activity, it may 
be useful to encode a chimeric NSPLP protein that is recognized by a 
commercially available antibody. A fusion protein may also be engineered 
to contain a cleavage site located between a NSPLP sequence and the 
40 heterologous protein sequence, so that the NSPLP may be cleaved and 

purified away from the heterologous moiety. 

In an alternate embodiment of the invention, the coding sequence of 
NSPLP may be synthesized, whole or in part, using chemical methods well 
known in the art (see Caruthers MH et al (1980) Nuc Acids Res Symp Ser 



9 



10 



wo 98/0684 1 PCT/US97/13469 

215-23, Horn T et al(19B0) Nuc Acids Res Symp Ser 225-32, etc). 
Alternatively, the protein itself could be produced using chemical 
methods to synthesize a NSPLP amino acid sequence, whole or in part. For 
example, peptide synthesis can be performed using various solid-phase 
techniques (Roberge JY et al (1995) Science 269:202-204) and automated 
synthesis may be achieved, for example, using the ABI 4 31A Peptide 
Synthesizer (Per kin Elmer) in accordance with the instructions provided 
by the manufacturer. 

The newly synthesized peptide can be substantially by preparative 
high performance liquid chromatography (eg, Creighton (1983) Proteins . 
Structures aad MQl^CUlar PrincinlPs. wH Freeman and Co, New York NY) . 
The composition of the synthetic peptides may be confirmed by amino acid 
analysis or sequencing (eg, the Edman degradation procedure; Creighton, 
supra). Additionally the amino acid sequence of NSPLP, or any part 
thereof, may be altered during direct synthesis and/or combined using 
chemical methods with sequences from other proteins, or any part thereof, 
to produce a variant polypeptide, 

Erogi>ggjLgn gyittna 

In order to express a biologically active NSPLP, the nucleotide 
20 sequence encoding NSPLP or its functional equivalent, is inserted into an 

appropriate expression vector, ie, a vector which contains the necessary 
elements for the transcription and translation of the inserted coding 
sequence . 

Methods which are well known to those skilled in the art can be 
2^ 'Jsed to construct expression vectors containing a NSPLP coding sequence 

and appropriate transcriptional or translational controls. These methods 
include in yitro recombinant DNA techniques, synthetic techniques and in 
YXVQ recombination or genetic recombination. Such techniques are 
described in Sambrook et al (1989) Molecular UjMillfl, Laboratory 
ManiJAl* Cold Spring Harbor Press, Piainview NY and Ausubel FM et al 
(1989) Current PrPtOCQl? in Molecular Biology . John Wiley & sons, New 
York NY. 

A variety of expression vector/host systems may be utilized to 
contain and express a NSPLP coding sequence. These include but are not 
limited to microorganisms such as bacteria transformed with recombinant 
bacteriophage, plasmid or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors; insect cell systems infected 
with virus expression vectors (eg, baculovirus) ; plant cell systems 
transfected with virus expression vectors (eg, cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or transformed with bacterial expression 
vectors (eg, Ti or pBR322 plasmid); or animal cell systems. 

The ^'control elements" or "regulatory sequences" of these systems 
vary in their strength and specificities and are those nontranslated 
regions of the vector, enhancers, promoters, and 3' untranslated regions. 
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which interact with host cellular proteins to carry out transcription and 
translation. Depending on the vector system and host utilized, any 
number of suitable transcription and translation elements, including 
constitutive and inducible promoters, may be used. For example, when 
cloning in bacterial systems, inducible promoters such as the hybrid lacZ 
promoter of the Bluescript® phagemid (Stratagene, LaJolla CA) or pSportl 
{Gibco BRL) and ptrp-iac hybrids and the like may be used. The 
baculovirus polyhedrin promoter may be used in insect cells. Promoters 
or enhancers derived from the genomes of plant cells (eg, heat shock, 
RUBISCO; and storage protein genes) or from plant viruses (eg, viral 
promoters or leader sequences) may be cloned into the vector.. In 
mammalian cell systems, promoters from the mammalian genes or from 
mammalian viruses are most appropriate. If it is necessary to generate a 
cell line that contains multiple copies of NSPLP, vectors based on SV4 0 
or EBV may be used with an appropriate seleqtable marker. 

In bacterial systems, a number of expression vectors may be 
selected depending upon the use intended for NSPLP. For example, when 
large quantities of NSPLP are needed for the induction of antibodies, 
vectors which direct high level expression of fusion proteins that are 
readily purified may be desirable. Such vectors include, but are not 
limited to, the multifunctional £. qqIx cloning and expression vectors 
such as Bluescript® (Stratagene) , in which the NSPLP coding sequence may 
be ligated into the vector in frame with sequences for the amino-terminal 
Met and the subsequent 7 residues of li-galactosidase so that a hybrid 
protein is produced; pIN vectors (Van Heeke & Schuster (1989) J Biol Chem 
264:5503-5509); and the like. pGEX vectors (Promega, Madison WI) may 
also be used to express foreign polypeptides as fusion proteins with 
glutathione S-transferase (GST). In general, such fusion proteins are 
soluble and can easily be purified from iysed cells by adsorption to 
glutathione-agarose beads followed by elution in the presence of free 
glutathione. Proteins made in such systems are designed to include 
heparin, thrombin or factor XA protease cleavage sites so that the cloned 
polypeptide of interest can be released from the GST moiety at will. 

In the yeast, Saccharomvces cerevisiae . a number of vectors 
containing constitutive or inducible promoters such as alpha factor, 
alcohol oxidase and PGH may be used. For reviews, see Ausubel et al 
(supra) and Grant et al (1987) Methods in Enzymology 153:516-544. 

In cases where plant expression vectors are used, the expression of 
a sequence encoding NSPLP may be driven by any of a number of promoters. 
For example, viral promoters such as the 35S and 19S promoters of CaMV 
(Brisson et al (1984) Nature 310:511-514) may be used alone or in 
combination with the omega leader sequence from TMV (Takamatsu et al 
(1987) EMBO J 6:307-311). Alternatively, plant promoters such as the 
small subunit of RUBISCO (Coruzzi et al (1984) EMBO J 3:1671-1680; 
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Brogiie et al (1984) Science 224:839-843); or heat shock promoters 
(Winter J and Sinibaldi RM (1991) Results Probi Ceil Differ 17:85-105) 
may be used. These constructs can be introduced into plant cells by 
direct DNA transformation or pathogen-mediated transf ection . For reviews 
5 of such techniques, see Hobbs S or Murry LE in McGraw Hill Yearbook of 

Science and Technology (1992) McGraw Hill New York NY, pp 191-196 or 
Weissbach and Weissbach (1988) Methods for Plant Molecular Biology . 
Academic Press, New York NY, pp 421-463. 

An alternative expression system which could be used to express 

10 NSPLP is an insect system. In one such system, Autoorapha californi^^^ ^ 

nuclear polyhedrosis virus (AcNPV) is used as a vector to express foreign 
genes in Soodoptera frugioerda cells or in Trichoolusia larvae. The 
NSPLP coding sequence may be cloned into a nonessential region of the 
virus, such as the polyhedrin gene, and placed under control of the 

15 polyhedrin promoter. Successful insertion of-NSPLP will render the 

polyhedrin gene inactive and produce recombinant virus lacking coat 
protein coat. The recombinant viruses are then used to infect ^. 
£rU<jiPSrdfl cells or Trichoolusia larvae in which NSPLP is expressed 
(Smith et al (1983) J Virol 46:584; Engelhard EK et al (1994) Proc Nat 

20 Acad Sci 91:3224-7) . 

In mammalian host cells, a number of viral-based expression systems 
may be utilized. In cases where an adenovirus is used as an expression 
vector, a NSPLP coding sequence may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and 

25 tripartite leader sequence. Insertion in a nonessential El or E3 region 

of the viral genome will result in a viable virus capable of expressing 
NSPLP in infected host ceils (Logan and Shenk (1984) Proc Natl Acad Sci 
81:3655-59). Xn addition, transcription enhancers, such as the rous 
sarcoma virus (RSV) enhancer, may be used to increase expression in 

30 mammalian host cells. 

Specific initiation signals may also be required for efficient 
translation of a NSPLP sequence. These signals include the ATG 
initiation codon and adjacent sequences. In cases where NSPLP, its 
initiation codon and upstream sequences are inserted into the appropriate 

35 expression vector, no additional translational control signals may be 

needed. However, in cases where only coding sequence, or a portion 
thereof, is inserted, exogenous transcriptional control signals including 
the ATG initiation codon must be provided. Furthermore, the initiation 
codon must be in the correct reading frame to ensure transcription of the 

40 entire insert. Exogenous transcriptional elements and initiation codons 

can be of various origins, both natural and synthetic. The efficiency of 
expression may be enhanced by the inclusion of enhancers appropriate to 
the cell system in use (Scharf D et al (1994) Results Probi Cell Differ 
20:125-62; Bittner et al (1987) Methods in Enzymol 153:516-54 4). 
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In addition, a hose ceil strain may be chosen for its ability to 
modulate the expression of the inserted sequences or to process the 
expressed protein in the desired fashion. Such modifications of the 
polypeptide include, but are not limited to, acetylation, carboxylation, 
5 glycosylation, phosphorylation, lipidation and acylation. 

Post-translational processing which cleaves a "prepro" form of the 
protein may also be important for correct insertion, folding and/or 
function. Different host cells such as CHO, HeLa, MOCK, 293, WI38, etc 
have specific cellular machinery and characteristic mechanisms for such 

10 post-translational activities and may be chosen to ensure the correct 

modification and processing of the introduced, foreign protein. 

For long-term, high-yield production of recombinant proteins, 
stable expression is preferred. For example, cell lines which stably 
express NSPLP may be transformed using expression vectors which contain 

15 viral origins of replication or endogenous expression elements and a 

selectable marker gene. Following the introduction of the vector, cells 
may be allowed to grow for 1-2 days in an enriched media before they are 
switched to selective media. The purpose of the selectable marker is to 
confer resistance to selection, and its presence allows growth and 

20 recovery of cells which successfully express the introduced sequences. 

Resistant clumps of stably transformed cells can be proliferated using 
tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex 

25 virus thymidine kinase (Wigler M et al (1977) Cell 11:223-32) and adenine 

phosphoribosyltransferase {Lowy I et al (1980) Cell 22:817-23) genes 
which can be employed in tk- or aprt- cells, respectively. Also, 
antimetabolite, antibiotic or herbicide resistance can be used as the; 
basis for selection; for example, dhfr which confers resistance to 

30 methotrexate {Wigler M et al (1980) Proc Natl Acad Sci 77:3567-70); npt, 

which confers resistance to the aminoglycosides neomycin and G-418 
(Coibere-Garapin F et al (1981) J Mol Biol 150:1-14) and als or pat, 
which confer resistance to chlorsulfuron and phosphinotricin 
acetyltransf erase, respectively (Murry, supra) . Additional selectable 

35 genes have been described, for example, trpB, which allows cells to 

utilize indole in place of ' tryptophan, or hisD, which allows cells to 
utilize histinoi in place of histidine (Hartman SC and RC Mulligan (1988) 
Proc Natl Acad Sci 85:8047-51). Recently, the use of visible markers has 
gained popularity with such markers as anthocyanins, B glucuronidase and 

40 its substrate, GUS, and luciferase and its substrate, luciferin, being 

widely used not only to identify transf ormants, but also to quantify the 
amount of transient or stable protein expression attributable to a 
specific vector system (Rhodes CA et al (1995) Methods Mol Biol 
55:121-131) . 
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Idantificatiion of Transf ormantg Cont:ainin<;T tha PelvnueleotidQ Saouance 

Although the presence/absence of marker gene expression suggests 
that the gene of interest is also present, its presence and expression 
should be confirmed. For example, if the NSPLP is inserted within a 
marker gene sequence, recombinant cells containing NSPLP can be 
identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a NSPLP sequence under the 
control of a single promoter. Expression of the marker gene in response 
to induction or selection usually indicates expression of the tandem 
NSPLP as well. 

Alternatively, host cells which contain the coding sequence for 
NSPLP and express NSPLP may be identified by a variety of procedures 
known to those of skill in the art. These procedures include, but are 
not limited to, DNA-DNA or DNA-RNA hybridization and protein bioassay or 
immunoassay techniques which include membrane, solution, or chip based 
technologies for the detection and/or quantification of the nucleic acid 
or protein. 

The presence of the polynucleotide sequence encoding NSPLP can be 
detected by DNA-DNA or DNA-RNA hybridization or amplification using 
probes, portions or fragments of polynucleotides encoding NSPLP. Nucleic 
acid amplification based assays involve the use of oligonucleotides or 
oligomers based on the NSPLP-encoding sequence to detect transf ormants 
containing DNA or RNA encoding NSPLP. As used herein "oligonucleotides" 
or "oligomers" refer to a nucleic acid sejquence of at least about 10 
nucleotides and as many as about 60 nucleotides, preferably about 15 to 
30 nucleotides, and more preferably about 20-25 nucleotides which can be 
used as a probe or amplimer. 

A variety of protocols for detecting and measuring the expression 
of NSPLP, using either polyclonal or monoclonal antibodies specific for 
the protein are known in the art. Examples include enzyme-linked 
immunosorbent assay (ELISA) , radioimmunoassay (RIA) and fluorescent 
activated cell sorting (FACS) . A two-site, tnonocionai-based immunoassay 
utilizing monoclonal antibodies reactive to two non-interfering epitopes 
on NSPLP is preferred, but a competitive binding assay may be employed. 
These and other assays are described, among other places, in Hampton R et 
al (1990, Serological Methods , 4 Laboratory ManUfli> APS Press, St Paul 
MN) and Maddox DE et al {1983, J Exp Med 158:1211). 

A wide variety of labels and conjugation techniques are known by 
those skilled in the art and can be used in various nucleic acid and 
amino acid assays. Means for producing labeled hybridization or PCR 
probes for detecting sequences related to polynucleotides encoding NSPLP 
include oiigolabeling, nick translation, end-labeling or PCR 
amplification using a labeled nucleotide. Alternatively, the NSPLP 
sequence, or any portion of it, may be cloned into a vector for the 



wo 98/06841 PCT/US97/13469 

production of an mRNA probe. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes iQ vitro 
by addition of an appropriate RNA polymerase such as T7, T3 or SP6 and 
labeled nucleotides. 

5 A number of companies such as Pharmacia Biotech (Piscataway NJ) , 

Promega (Madison WI), and US Biochemical Corp (Cleveland OH) supply 
commercial kits and protocols for these procedures. Suitable reporter 
molecules or labels include those radionuclides, enzymes, fluorescent, 
chemiluminescent, or chromogenic agents as well as substrates, cofactors, 

10 inhibitors, magnetic particles and the like. Patents teaching the use of 

such labels include US Patents 3,817,837; 3,850,752; 3,939,350; 
3,996,345; 4,277,437; 4,275,149 and 4,366,241. Also, recombinant 
immunoglobulins may be produced as shown in US Patent No. 4,816,567 
incorporated herein by reference. 

15 Purification of WSPLP 

Host cells transformed with a nucleotide sequence encoding NSPLP 
may be cultured under conditions suitable for the expression and recovery 
of the encoded protein from cell culture. The protein produced by a 
recombinant cell may be secreted or contained intracellularly depending 

20 on the sequence and/or the vector used. As will be understood by those 

of skill in the art, expression vectors containing polynucleotides 
encoding NSPLP can be designed with signal sequences which direct 
secretion of NSPLP through a prokaryotic or eukaryotic cell membrane. 
Other recombinant constructions may join NSPLP to nucleotide sequence 

25 encoding a polypeptide domain which will facilitate purification of 

soluble proteins (Kroll DJ et al (1993) DMA Cell Biol 12:441-53; cf 
discussion of vectors infra containing fusion proteins) . 

NSPLP may also be expressed as a recombinant protein with one or 
more additional polypeptide domains added to facilitate protein 

30 purification. Such purification facilitating domains include, but are 

not limited to, metal chelating peptides such as histidine-tryptophan 
modules that allow purification on immobilized metals, protein A domains 
that allow purification on immobilized immunoglobulin, and the domain 
utilized in the FLAGS extension/affinity purification system (Imraunex 

35 Corp, Seattle WA) . The inclusion of a cleavable linker sequences such as 

Factor XA or enterokinase ' (Invitrogen, San Diego CA) between the 
purification domain and NSPLP is useful to facilitate purification. One 
such expression vector provides for expression of a fusion protein 
compromising an NSPLP and contains nucleic acid encoding 6 histidine 

40 residues followed by thioredoxin and an enterokinase cleavage site. The 

histidine residues facilitate purification on IMIAC (immobilized metal 
ion affinity chromotography as described in Porath et al (1992) Protein 
Expression and Purification 3: 263-281) while the enterokinase cleavage 
site provides a means for purifying NSPLP from the fusion protein. 
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In addition to recombinant production, fragments of NSPLP may be 
produced by direct peptide synthesis using solid-phase techniques (cf 
Stewart et al (1969) ^n} id-Phase Peptide Synthesis . WH Freeman Co, San 
Francisco; Merrifield J (1963) J An Chem Soc 85:2149-2154). Xq vitro 
protein synthesis may be performed using manual techniques or by 
automation. Automated synthesis may be achieved, for example, using 
Applied Biosystems 431A Peptide Synthesizer (Perkin Elmer, Foster City 
CA) in accordance with the instructions provided by the manufacturer. 
Various fragments of NSPLP may be chemically synthesized separately and 
combined using chemical methods to produce the full length molecule. 

The rationale for use of the nucleotide and polypeptide sequences 
disclosed herein is based in part on the chemical and structural homology 
among the novel NSPLP proteins disclosed herein, NSP-A (GI 307307; 
Roebroek et al, supra), NSP-B (GI 307309; Roebroek et al, supra), NSP-C 
(GI 307311; Roebroek et al, supra), and rat CI-13 (GI 281046; Wieczorek 
et al, supra) . 

Accordingly, NSPLP or a NSPLP derivative may be used to treat 
cancer and neurodegenerative disorders, such as ALS. In those conditions 
where NSPLP protein activity is not desirable, cells could be transfected 
with antisense sequences of NSPLP-encoding polynucleotides or provided 
with antagonists of NSPLP. 
NSPLP Antibodiaa 

NSPLP-specif ic antibodies are useful for the diagnosis of 
conditions and diseases associated with expression of NSPLP. Such 
antibodies may include, but are not limited to, polyclonal, monoclonal, 
chimeric, single chain, Fab fragments and fragments produced by a Fab 
expression library. Neutralizing antibodies, ie, those which inhibit 
dimer formation, are especially preferred for diagnostics and 
therapeutics. 

NSPLP for antibody induction does not require biological activity; 
however, the protein fragment, or oligopeptide must be antigenic. 
Peptides used to induce specific antibodies may have an amino acid 
sequence consisting of at least five amino acids, preferably at least 10 
amino acids. Preferably, they should mimic a portion of the amino acid 
sequence of the natural protein and may contain the entire amino acid 
sequence of a small, naturally occurring molecule. Short stretches of 
NSPLP amino acids may be fused with those of another protein such as 
keyhole limpet hemocyanin and antibody produced against the chimeric 
molecule. Procedures well known in the art can be used for the 
production of antibodies to NSPLP. 

For the production of antibodies, various hosts including goats, 
rabbits, rats, mice, etc may be immunized by injection with NSPLP or any 
portion, fragment or oligopeptide which retains immunogenic properties. 



16 



wo 98/06841 PCT/US97/13469 

Depending on the host species^ various adjuvants may be used to increase 
immunological response. Such adjuvants include but are not limited to, 
Freund's, mineral gels such as aluminum hydroxide, and surface active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, 
5 oil emulsions, keyhole limpet hemocyanin, and dinitrophenoi . BCG 

(bacilli Caliuette-Guerin) and Corynebacterium pgrVCT are potentially 
useful human adjuvants. 

Monoclonal antibodies to NSPLP may be prepared using any technique 
which provides for the production of antibody molecules by continuous 

10 cell lines in culture. These include but are not limited to the 

hybridoma technique originally described by Koehler and Milstein (1975 
Nature 256:495-497), the human B-cell hybridoma technique (Kosbor et al 
(1983) Immunol Today 4:72; Cote et al (1983) Proc Natl Acad Sci 
80:2026-2030) and the EBV-hybridoma technique (Cole et al (1985) 

15 Monoclonal Antibodies and Cancer Therapy , Alan R Liss Inc, New York NY, 

pp 77-96) . 

In addition, techniques developed for the production of "chimeric 
antibodies"/ the splicing of mouse antibody genes to human antibody genes 
to obtain a molecule with appropriate antigen specificity and biological 

20 activity can be used (Morrison et al (1984) Proc Natl Acad Sci 

81:6851-6855; Neuberger et al (1984) Nature 312:604-608; Takeda et al 
(1985) Nature 314:452-4 54). Alternatively, techniques described for the 
production of single chain antibodies (US Patent No. 4,946,778) can be 
adapted to produce NSPLP-specif ic single chain antibodies 

25 Antibodies may also be produced by inducing in vivo production in 

the lymphocyte population or by screening recombinant immunoglobulin 
libraries or panels of highly specific binding reagents as disclosed in 
Orlandi et al (1989, Proc Natl Acad Sci 86: 3833-3837), and Winter G and 
Milstein C (1991; Nature 349:293-299). 

30 Antibody fragments which contain specific binding sites for NSPLP 

may also be generated. For example, such fragments include, but are not 
limited to, the F(ab')2 fragments which can be produced by pepsin 
digestion of the antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of the F(ab')2 fragments. 

35 Alternatively, Fab expression libraries may be constructed to allow rapid 

and easy identification o€ monoclonal Fab fragments with the desired 
specificity (Huse WD et al (1989) Science 256:1275-1281). 

A variety of protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established 

40 specificities are well known in the art. Such immunoassays typically 

involve the formation of complexes between NSPLP and its specific 
antibody and the measurement of complex formation. A two-site, 
monoclonal-based immunoassay utilizing monoclonal antibodies reactive to 
two noninterf ering epitopes on a specific NSPLP protein is preferred, but 
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a competitive binding assay may also be employed. These assays are 
described in Maddox DE et al (1983, J Exp Med 158:1211). 
Diaanoatic Aaaav Uaina WSPLP Sp^eif AntibgAi^g 

Particular NSPLP antibodies are useful for the diagnosis of 
5 conditions or diseases characterized by expression of NSPLP or in assays 

to monitor patients being treated with NSPLP, agonists or inhibitors. 
Diagnostic assays for NSPLP include methods utilizing the antibody and a 
label to detect NSPLP in human body fluids or extracts of cells or 
tissues. The polypeptides and antibodies of the present invention may be 

10 used with or without modification. Frequently, the polypeptides and 

antibodies will be labeled by joining them, either covalently or 
noncovalently, with a reporter molecule. A wide variety of reporter 
molecules are known, several of which were described above. 

A variety of protocols for measuring NSPLP, using either polyclonal 

15 or monoclonal antibodies specific for the respective protein are known in 

the art. Examples include enzyme-linked immunosorbent assay (ELISA) , 
radioimmunoassay (RIA) and fluorescent activated cell sorting (FACS). A 
two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering epitopes on NSPLP is preferred, but a 

20 competitive binding assay may be employed. These assays are described, 

among other places, in Maddox, DE et al (1983, J Exp Med 158:1211). 

In order to provide a basis for diagnosis, normal or standard 
values for NSPLP expression must be established. This is accomplished by 
combining body fluids or cell extracts taken from normal subjects, either 

25 animal or human, with antibody to NSPLP under conditions suitable for 

complex formation which are well known in the art. The amount of 
standard complex formation may be quantified by comparing various 
artificial membranes containing known quantities of NSPLP with both 
control and disease samples from biopsied tissues. Then, standard values 

30 obtained from normal samples may be compared with values obtained from 

samples from subjects potentially affected by disease. Deviation between 
standard and subject values establishes the presence of disease state. 

Drag gggMninq 

NSPLP, its catalytic or immunogenic fragments or oligopeptides 
35 thereof, can be used for screening therapeutic compounds in any of a 

variety of drug screening techniques. The fragment employed in such a 
test may be free in solution, affixed to a solid support, borne on a eel* 
surface, or located intracellularly . The formation of binding complexes, 
between NSPLP and the agent being tested, may be measured. 
40 Another technique for drug screening which may be used provides for 

high throughput screening of compounds having suitable binding affinity 
to the NSPLP is described in detail in '^Determination of Amino Acid 
Sequence Antigenicity" by Geysen HN, WO Application 84/03564, published 
on September 13, 1984, and incorporated herein by reference. In summary, 
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large numbers of different small peptide test compounds are synthesized 
on a solid substrate, such as plastic pins or some other surface. The 
peptide test compounds are reacted with fragments of NSPLP and washed. 
Bound NSPLP is then detected by methods well known in the art. Purified 
5 NSPLP can also be coated directly onto plates for use in the 

aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and 
immobilize it on a solid support. 

This invention also contemplates the use of competitive drug 

10 screening assays in which neutralizing antibodies capable of binding 

NSPLP specifically compete with a test compound for binding NSPLP. In 
this manner, the antibodies can be used to detect the presence of any 
peptide which shares one or more antigenic determinants with NSPLP. 
Uses of th& PolvnueleotidQ Encoding NSPLP 

15 A polynucleotide encoding NSPLP, or any part thereof, may be used 

for diagnostic and/or therapeutic purposes. For diagnostic purposes, 
polynucleotides encoding NSPLP of this invention may be used to detect 
and quantitate gene expression in biopsied tissues in which expression of 
NSPLP may be implicated. The diagnostic assay is useful to distinguish 

20 between absence, presence, and excess expression of NSPLP and to monitor 

regulation of NSPLP levels during therapeutic intervention. Included in 
the scope of the invention are oligonucleotide sequences, antisense RNA 
and DNA molecules, and PNAs . 

Another aspect of the subject invention is to provide for 

25 hybridization or PCR probes which are capable of detecting polynucleotide 

sequences, including genomic sequences, encoding NSPLP or closely related 
molecules. The specificity of the probe, whether it is made from a 
highly specific region, eg, 10 unique nucleotides in the 5' regulatory 
region, or a less specific region, eg, especially in the 3' region, and 

30 the stringency of the hybridization or amplification (maximal, high, 

intermediate or low) will determine whether the probe identifies only 
naturally occurring sequences encoding NSPLP, alleles or related 
sequences . 

Probes may also be used for the detection of related sequences and 
35 should preferably contain at least 50% of the nucleotides from any of 

these NSPLP encoding sequences. The hybridization probes of the subject 
invention may be derived from the nucleotide sequence of SEQ ID NO: 2 or 
from genomic sequence including promoter, enhancer elements and introns 
of the naturally occurring NSPLP. Hybridization probes may be labeled by 
40 a variety of reporter groups, including radionuclides such as 32P or 35S, 

or enzymatic labels such as alkaline phosphatase coupled to the probe via 
avidin/biotin coupling systems, and the like. 

Other means for producing specific hybridization probes for DNAs 
encoding NSPLP include the cloning of nucleic acid sequences encoding 
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NSPLP or NSPLP derivatives into vectors for the production of mRNA 
probes. Such vectors are known in the art and are commercially available 
and may be used to synthesize RNA probes in vitro by means of the 
addition of the appropriate RNA polymerase as Tl or SP6 RNA polymerase 
and the appropriate radioactively labeled nucleotides. 

Polynucleotide sequences encoding NSPLP may be used for the 
diagnosis of conditions or diseases with which the expression of NSPLP is 
associated. For example, polynucleotide sequences encoding NSPLP may be 
used in hybridization or PCR assays of fluids or tissues from biopsies to 
detect NSPLP expression. The form of such qualitative or quantitative 
methods may include Southern or northern analysis, dot blot or other 
membrane-based technologies; PCR technologies; dip stick, pXN, chip and 
ELISA technologies. All of these techniques are well known in the art 
and are the basis of many commercially available diagnostic kits. 

The nucleotide sequences encoding NSPLP^disclosed herein provide 
the basis for assays that detect activation or induction associated with 
cancer and neurodegenerative disorders, such as ALS . The nucleotide 
sequence encoding NSPLP may be labeled by methods known in the art and 
added to a fluid or tissue sample from a patient under conditions 
suitable for the formation of hybridization complexes. After an 
incubation period, the sample is washed with a compatible fluid which 
optionally contains a dye (or other label requiring a developer) if the 
nucleotide has been labeled with an enzyme. After the compatible fluid 
is rinsed off, the dye is quantitated and compared with a standard. If 
the amount of dye in the biopsied or extracted sample is significantly 
elevated over that of a comparable control sample, the nucleotide 
sequence has hybridized with nucleotide sequences in the sample, and the 
presence of elevated levels of nucleotide sequences encoding NSPLP in the 
sample indicates the presence of the associated disease. 

Such assays may also be used to evaluate the efficacy of a 
particular therapeutic treatment regime in animal studies, in clinical 
trials, or in monitoring the treatment of an individual patient. In 
order to provide a basis for the diagnosis of disease, a normal or 
standard profile for NSPLP expression must be established. This is 
accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with NSPLP, or a portion thereof, under 
conditions suitable for hybridization or amplification. Standard 
hybridization may be quantified by comparing the values obtained for 
normal subjects with a dilution series of NSPLP run in the same 
experiment where a known amount of a substantially purified NSPLP is 
used. Standard values obtained from normal samples may be compared with 
values obtained from samples from patients afflicted with 

NSPLP-associated diseases. Deviation between standard and subject values 
is used to establish the presence of disease. 
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Once disease is established, a therapeutic agent is administered 
and a treatment profile is generated. Such assays may be repeated on a 
regular basis to evaluate whether the values in the profile progress 
toward or return to the normal or standard pattern. Successive treatment 
5 profiles may be used to show the efficacy of treatment over a period of 

several days or several months. 

PGR, as described in US Patent Nos. 4,683,195 and 4,965,188, 
provides additional uses for oligonucleotides based upon the NSPLP 
sequence. Such oligomers are generally chemically synthesized, but they 
10 may be generated enzymatically or produced from a recombinant source. 

Oligomers generally comprise two nucleotide sequences, one with sense 
orientation (5'->3') and one with antisense (3'<-5'), employed under 
optimized conditions for identification of a specific gene or condition. 
The same two oligomers, nested sets of oligomers, or even a degenerate 
15 pool of oligomers may be employed under less stringent conditions for 

detection and/or quantitation of closely related DNA or RNA sequences. 

Additionally, methods which may be used to quantitate the 
expression of a particular molecule include radioiabeling (Melby PC at ai 
1993 J Immunol Methods 159:235-44) or biotinylat ing (Duplaa C et al 1993 
20 Anal Biochem 229-36) nucleotides, coamplif ication of a control nucleic 

acid, and standard curves onto which the experimental results are 
interpolated. Quantitation of multiple samples may be speeded up by 
running the assay in an ELISA format where the oligomer of interest is 
presented in various dilutions and a spectrophotometric or colorimetrtc 
25 response gives rapid quantitation. For example, the presence of a 

relatively high amount of NSPLP in extracts of biopsied tissues may 
indicate the onset of cancer. A definitive diagnosis of this type may 
allow health professionals to begin aggressive treatment and prevent 
further worsening of the condition. Similarly, further assays can be 
30 used to monitor the progress of a patient during treatment. Furthermore, 

the nucleotide sequences disclosed herein may be used in molecular 
biology techniques that have not yet been developed, provided the new 
techniques rely on properties of nucleotide sequences that are currently 
)cnown such as the triplet genetic code, specific base pair interactions, 
35 and the like. 

Based upon its homology to genes encoding NSP-iike proteins and its 
expression profile, polynucleotide sequences encoding NSPLP disclosed 
herein may be useful in the treatment of conditions such as cancer and 
40 neurodegenerative disorders, such as ALS. 

Expression vectors derived from retroviruses, adenovirus, herpes or 
vaccinia viruses, or from various bacterial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue or cell 
population. Methods which are well known to those skilled in the art can 
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be used to construct recombiaant vectors which will express antisense 
polynucleotides of the gene encoding NSPLP. See, for example, the 
techniques described in Sambrook et al (supra) and Ausubel et al (supra) . 
The polynucleotides comprising full length cDNA sequence and/or its 
5 regulatory elements enable researchers to use sequences encoding NSPLP as 

an investigative tool in sense (Youssoufian H and HF Lodish 1993 Mol Cell 
Biol 13:98-104) or antisense (Eguchi et al (1991) Annu Rev Biochem 
60:631-652) regulation of gene function. Such technology is now well 
known in the art, and sense or antisense oligomers, or larger fragments, 
10 can be designed from various locations along the coding or control 

regions. 

Genes encoding NSPLP can be turned off by transfecting a cell or 
tissue with expression vectors which express high levels of a desired 
NSPLP-encoding fragment. Such constructs can flood cells with 

15 untranslatable sense or antisense sequences. *Even in the absence of 

integration into the DNA, such vectors may continue to transcribe RNA 
molecules until all copies are disabled by endogenous nucleases. 
Transient expression may last for a month or more with a non-replicating 
vector (Mettler I, personal communication) and even longer if appropriate 

20 replication elements are part of the vector system. 

As mentioned above, modifications of gene expression can be 
obtained by designing antisense molecules, DNA, RNA or PNA, to the 
control regions of gene encoding NSPLP, ie, the promoters, enhancers, and 
introns. Oligonucleotides derived from the transcription initiation 

25 site, eg, between -10 and +10 regions of the leader sequence, are 

preferred. The antisense molecules may also be designed to block 
translation of mRNA by preventing the transcript from binding to 
ribosomes. Similarly, inhibition can be achieved using "triple helix" 
base-pairing methodology. Triple helix pairing compromises the ability 

30 of the double helix to open sufficiently for the binding of polymerases, 

transcription factors, or regulatory molecules. Recent therapeutic 
advances using triplex DNA were reviewed by Gee JE et al (In: Huber BE 
and BI Carr (1994) Molecular and Immunologic Approaches. Futura 
Publishing Co, Mt Kisco NY) . 

35 Ribozymes are enzymatic RNA molecules capable of catalyzing the 

specific cleavage of RNA. The mechanism of ribozyme action involves 
sequence-specific hybridization of the ribozyme molecule to complementary 
target RNA, followed by endonucleolytic cleavage. Within the scope of 
the invention are engineered hammerhead motif ribozyme molecules that can 

40 specifically and efficiently catalyze endonucleolytic cleavage of 

sequences encoding NSPLP. 

Specific ribozyme cleavage sites within any potential RNA target 
are initially identified by scanning the target molecule for ribozyme 
cleavage sites which include the following sequences, GUA, GUU and GUC. 
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Once identified, short RNA sequences of between 15 and 20 ribonucleotides 
corresponding to the region of the target gene containing the cleavage 
site may be evaluated for secondary structural features which may render 
the oligonucleotide inoperable. The suitability of candidate targets may 
5 also be evaluated by testing accessibility to hybridization with 

complementary oligonucleotides using ribonuclease protection assays. 

Antisense molecules and ribozymes of the invention may be prepared 
by any method known in the art for the synthesis of RNA molecules. These 
include techniques for chemically synthesizing oligonucleotides such as 

10 solid phase phosphoramidite chemical synthesis. Alternatively, RNA 

molecules may be generated by ia vitro and in vivo transcription of DNA 
sequences encoding NSPLP. Such DNA sequences may be incorporated into a 
wide variety of vectors with suitable RNA polymerase promoters such as T7 
or SP6. Alternatively, antisense cDNA constructs that synthesize 

15 antisense RNA constitutively or inducibly can be introduced into cell 

lines, cells or tissues. 

RNA molecules may be modified to increase intracellular stability 
and half-life. Possible modifications include, but are not limited to, 
the addition of flanking sequences at the 5' and/or 3' ends of the 

20 molecule or the use of phosphorothioate or 2' O-methyl rather than 

phosphodiesterase linkages within the backbone of the molecule. This 
concept is inherent in the production of PNAs and can be extended in all 
of these molecules by the inclusion of nontraditional bases such as 
inosine, queosine and wybutosine as well as acetyl-, methyl-, thio- and 

25 similarly modified forms of adenine, cytidine, guanine, thymine, and 

uridine which are not as easily recognized by endogenous endonucleases . 

Methods for introducing vectors into cells or tissues include those 
methods discussed infra and which are equally suitable for in vivo . In 
vitro and qz. vivQ therapy. For £ii vivo therapy, vectors are introduced 

30 into stem cells taken from the patient and clonaliy propagated for 

autologous transplant back into that same patient is presented in US 
Patent Nos . 5,399,493 and 5,437,994, disclosed herein by reference. 
Delivery by transfection and by liposome are quite well known in the art. 
Furthermore, the nucleotide sequences for NSPLP disclosed herein 

35 may be used in molecular biology techniques that have not yet been 

developed, provided the nfew techniques rely on properties of nucleotide 
sequences that are currently known, including but not limited to such 
properties as the triplet genetic code and specific base pair 
interactions . 

40 DQtoetion and tdappino of Related Polvnucleotide Seouencea 

The nucleic acid sequence for NSPLP can also be used to generate 
hybridization probes for mapping the naturally occurring genomic 
sequence. The sequence may be mapped to a particular chromosome or to a 
specific region of the chromosome using well known techniques. These 



23 



CI loe-nxi iTC cuccx mi tl P 



wo 98/06841 PCT/US97/13469 

include in situ hybridization to chromosomal spreads, flow-sorted 
chromosomal preparations, or artificial chromosome constructions such as 
yeast artificial chromosomes, bacterial artificial chromosomes, bacterial 
PI constructions or single chromosome cDNA libraries as reviewed in Price 
CM (1993; Blood Rev 7:127-34) and Trask BJ (1991; Trends Genet 7:149-54). 

The technique of fluorescent Iq situ hybridization of chromosome 
spreads has been described, among other places, in Verma et ai (19881 
iimadl ChromoSgrngS : h tl^nu^ OL ^^SlS. Techniques. Pergamoa Press, New 
York NY. Fluorescent in situ hybridization of chromosomal preparations 
and other physical chromosome mapping techniques may be correlated with 
additional genetic map data. Examples of genetic map data can be found 
in the 1994 Genome Issue of Science (265:1981f). Correlation between the 
location of the gene encoding NSPLP on a physical chromosomal map and a 
specific disease (or predisposition to a specific disease) may help 
delimit the region of DNA associated with that genetic disease. The 
nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected 
individuals . 

In situ hybridization of chromosomal preparations and physical 
mapping techniques such as linkage analysis using established chromosomal 
markers may be used for extending genetic maps. For example an sequence 
tagged site based map of the human genome was recently published by the 
Whitehead-MIT Center for Genomic Research (Hudson TJ et al(1995) Science 
270:194 5-1954). Often the placement of a gene on the chromosome of 
another mammalian species such as mouse (Whitehead Institute/MIT Center 
for Genome Research, Genetic Map of the Mouse, Database Release 10, April 
28, 1995) may reveal associated markers even if the number or arm of a 
particular human chromosome is not known. New sequences can be assigned 
to chromosomal arms, or parts thereof, by physical mapping. This 
provides valuable information to investigators searching for disease 
genes using positional cloning or other gene discovery techniques. Once 
a disease or syndrome, such as ataxia telangiectasia (AT), has been 
crudely localized by genetic linkage to a particular genomic region, for 
example, AT to llq22-23 (Gatti et al (1988) Nature 336:577-580), any 
sequences mapping to that area may represent associated or regulatory 
genes for further investigation. The nucleotide sequence of the subject 
invention may also be used to detect differences in the chromosomal 
location due to translocation, inversion, etc. among normal, carrier or 
affected individuals. 
Pharmaceutical Compositions 

The present invention relates to pharmaceutical compositions which 
may comprise nucleotides, proteins, antibodies, agonists, antagonists, or 
inhibitors, alone or in combination with at least one other agent, such 
as stabilizing compound, which may be administered in any sterile, 
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biocompatible pharmaceutical carrier, including, but not limited to, 
saline, buffered saline, dextrose, and water. Any of these molecules can 
be administered to a patient alone, or in combination with- other agents, 
drugs or hormones, in pharmaceutical compositions where it is mixed with 
5 excipient(s) or pharmaceutically acceptable carriers. In one embodiment 

of the present invention, the pharmaceutically acceptable carrier is 
pharmaceutically inert. 

Adainistratiion of Pharmaceutical Compositions 

Administration of pharmaceutical compositions is accomplished 

10 orally or parenterally . Methods of parenteral delivery include topical, 

intra-arterial {directly to the tumor), intramuscular, subcutaneous, 
intramedullary, intrathecal, intraventricular, intravenous, 
intraperitoneal, or intranasal administration. In addition to the active 
ingredients, these pharmaceutical compositions may contain suitable 

15 pharmaceutically acceptable carriers comprising excipients and 

auxiliaries which facilitate processing of the active compounds into 
preparations which can be used pharmaceutically. Further details on 
techniques for formulation and administration may be found in the latest 
edition of ''Remington ' s Pharmaceutical Sciences" (Maack Publishing Co, 

20 Easton PA) . 

Pharmaceutical compositions for oral administration can be 
formulated using pharmaceutically acceptable carriers well known in the 
art in dosages suitable for oral administration. Such carriers enable 
the pharmaceutical compositions to be formulated as tablets, pills, 

25 dragees, capsules, liquids, gels, syrups, slurries, suspensions and the 

like, for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through 
combination of active compounds with solid excipient, optionally grinding 
a resulting mixture, and processing the mixture of granules, after adding 

30 suitable auxiliaries, if desired, to obtain tablets or dragee cores. 

Suitable excipients are carbohydrate or protein fillers such as sugars, 
including lactose, sucrose, mannitoi, or sorbitol; starch from corn, 
wheat, rice, potato, or other plants; cellulose such as methyl cellulose, 
hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; and gums 

35 including arable and tragacanth; and proteins such as gelatin and 

collagen. If desired, disintegrating or solubilizing agents may be 
added, such as the cross-linked polyvinyl pyrrolidone, agar, alginic 
acid, or a salt thereof, such as sodium alginate. 

Dragee cores are provided with suitable coatings such as 

40 concentrated sugar solutions, which may also contain gum arable, talc, 

polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titanium 
dioxide, lacquer solutions, and suitable organic solvents or solvent 
mixtures, Dyestuffs or pigments may be added to the tablets or dragee 
coatings for product identification or to characterize the quantity of 
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active compound, ie, dosage. 

Pharmaceutical preparations which can be used orally include 
push-fit capsules made of gelatin, as well as soft, sealed capsules made 
of gelatin and a coating such as glycerol or sorbitol. Push-fit capsules 
can contain active ingredients mixed with a filler or binders such as 
lactose or starches, lubricants such as talc or magnesium stearate, and, 
optionally, stabilizers. In soft capsules, the active compounds may be 
dissolved or suspended in suitable liquids, such as fatty oils, liquid 
paraffin, or liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations for parenteral administration include 
aqueous solutions of active compounds. For injection, the pharmaceutical 
compositions of the invention may be formulated in aqueous solutions, 
preferably in physiologically compatible buffers such as Hanks 's 
solution. Ringer's solution, or physiologically buffered saline. Aqueous 
injection suspensions may contain substances yrhich increase the viscosity 
of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Additionally, suspensions of the active compounds may be 
prepared as appropriate oily injection suspensions. Suitable lipophilic 
solvents or vehicles include fatty oils such as sesame oil, or synthetic 
fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. 
Optionally, the suspension may also contain suitable stabilizers or 
agents which increase the solubility of the compounds to allow for the 
preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the 
particular barrier to be permeated are used in the formulation. Such 
penetrants are generally known in the art. 
Manufaefcufa and Storaoo 

The pharmaceutical compositions of the present invention may be 
manufactured in a manner that known in the art, eg, by means of 
conventional mixing, dissolving, granulating, dragee-making, levigating, 
emulsifying, encapsulating, entrapping or iyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be 
formed with many acids, including but not limited to hydrochloric, 
sulfuric, acetic, lactic, tartaric, malic, succinic, etc. Salts tend to 
be more soluble in aqueous or other protonic solvents that are the 
corresponding free base forms. In other cases, the preferred preparation 
may be a iyophilized powder in lmM-50 mM histidine, 0.1%-2% sucrose, 
2%-7% mannitol at a pH range of 4.5 to 5.5 that is combined with buffer 
prior to use. 

After pharmaceutical compositions comprising a compound of the 
invention formulated in a acceptable carrier have been prepared, they can 
be placed in an appropriate container and labeled for treatment of an 
indicated condition. For administration of NSPLP, such labeling would 
include amount, frequency and method of administration. 
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ThQrapeut;ir.*llv Effective Doge 

Pharmaceutical compositions suitable for use in the present 
invention include compositions wherein the active ingredients are 
contained in an effective amount to achieve the intended purpose. The 
5 determination of an effective dose is well within the capability of those 

skilled in the art. 

For any compound, the therapeutically effective dose can be 
estimated initially either in ceil culture assays, eg, of neoplastic 
cells, or in animal models, usually mice, rabbits, dogs, or pigs. The 

10 animal model is also used to achieve a desirable concentration range and 

route of administration. Such information can then be used to determine 
useful doses and routes for administration in humans. 

A therapeutically effective dose refers to that amount of protein 
or its antibodies, antagonists, or inhibitors which ameliorate the 

15 symptoms or condition. Therapeutic efficacy and toxicity of such 

compounds can be determined by standard pharmaceutical procedures in ceil 
cultures or experimental animals, eg, ED50 (the dose therapeutically 
effective in 50% of the population) and LD50 (the dose lethal to 50% of 
the population). The dose ratio between therapeutic and toxic effects is 

20 the therapeutic index, and it can be expressed as the ratio, LD50/ED50. 

Pharmaceutical compositions which exhibit large therapeutic indices are 
preferred. The data obtained from cell culture assays and animal studies 
is used in formulating a range of dosage for human use. The dosage of 
such compounds lies preferably within a range of circulating 

25 concentrations that include the ED50 with little or no toxicity. The 

dosage varies within this range depending upon the dosage form employed, 
sensitivity of the patient, and the route of administration. 

The exact dosage is chosen by the individual physician in view of 
the patient to be treated. Dosage and administration are adjusted to 

30 provide sufficient levels of the active moiety or to maintain the desired 

effect. Additional factors which may be taken into account include the 
severity of the disease state, eg, tumor size and location; age, weight 
and gender of the patient; diet, time and frequency of administration, 
drug combination (s) , reaction sensitivities, and tolerance/response to 

35 therapy. Long acting pharmaceutical compositions might be administered 

every 3 to 4 days, every v^eek, or once every two weeks depending on 
half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up 
to a total dose of about 1 g, depending upon the route of administration. 

40 Guidance as to particular dosages and methods of delivery is provided in 

the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for 
nucleotides than for proteins or their inhibitors. Similarly, delivery 
of polynucleotides or polypeptides will be specific to particular cells. 
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conditions, locations, etc. 

It is contemplated, for example, that NSPLP or an NSPLP derivative 
can be delivered in a suitable formulation to block the progression of 
cancerous cell growth or of neuronal degeneration. Similarly, 
administration of NSPLP antagonists may also inhibit the activity or 
shorten the lifespan of this protein. 

The examples below are provided to illustrate the subject invention 
and are not included for the purpose of limiting the invention. 

INDUSTRIAL APPLICABILITY 
I Confltniction of cDNA Librariea 

THP-1 

THP-1 is a human leukemic cell line derived from the blood of a 1- 
year-old boy with acute monocytic leukemia. The THP-1 cells represent 
monocytes. The THP-1 cDNA library was custom constructed by Stratagene 
(Stratagene, 11099 M. Torrey Pines Rd,, La Jolla, CA 92037) essentially 
as described below. 

Stratagene prepared the cDNA library using oligo d(T) priming. 
Synthetic adapter oligonucleotides were ligated onto the cDNA molecules 
enabling them to be inserted into the Uni-ZAP* vector system 
(Stratagene) . This allowed high efficiency unidirectional (sense 
orientation) lambda library construction and the convenience of a piasmid 
system with blue/white color selection to detect clones with cDNA 
insertions . 

The quality of the cDNA library was screened using DNA probes, and 
then, the pBlue script® phagemid (Stratagene) was excised. This phagemid 
allows the use of a piasmid system for easy insert characterization, 
sequencing, site-directed mutagenesis, the creation of unidirectional 
deletions and expression of fusion polypeptides. Subsequently, the 
custom-constructed library phage particles were infected into E. coli 
host strain XLl-Blue® (Stratagene) . The high transformation efficiency 
of this bacterial strain increases the probability that the cDNA library 
will contain rare, under-represented clones. Alternative unidirectional 
vectors include, but are not limited to, pcDNAI (Invitrogen, San Diego 
CA) and pSHlox-1 (Novagen, Madison WI) . 

The human spleen cell cDNA library was custom constructed by 
Stratagene (catalogue # 937205. Stratagene, La Jolla CA) . The starting 
cell population is mixed, having been obtained from fetal spleens which 
have a diverse cell population. Furthermore, the fetal spleens have been 
pooled from different sources. Poly{A+) RNA (mRNA) was purified from the 
spleen cells. cDNA was synthesized from the mRNA. Synthetic adaptor 
oligonucleotides were ligated onto cDNA ends enabling its insertion into 
Uni-ZAP* vector system (Stratagene), allowing high efficiency 
unidirectional (sense orientation) lambda library construction and the 
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convenience of a plasmid system with blue/white color selection to detect 
clones with cDNA insertions. Alternative unidirectional vectors are 
pcDNAl (invitrogen, San Diego CAl and pSHiox-1 (Novagen, Madison WI). 
IX IitQlation of cDKA Clonea 

The phagemid forms of individual cDNA clones were obtained by the 
in vivo excision process, in which the host bacterial strain was co- 
infected with both the library phage and an fl helper phage. 
Polypeptides or enzymes derived from both the library-containing phage 
and the helper phage nicked the DNA, initiated new DNA synthesis from 
defined sequences on the target DNA, and created a smaller, single 
stranded circular phagemid DNA molecule that included all DNA sequences 
of the pBluescript phagemid and the cDNA insert. The phagemid DNA was 
released from the cells and purified, and used to reinfect fresh host 
cells (SOLR, Stratagene) where double-stranded phagemid DNA was produced. 
Because the phagemid carries the gene for p-lactamase, the newly 
transformed bacteria were selected on medium containing ampicillin. 

An alternate method of purifying phagemid has recently become 
available. It utilizes the Miniprep Kit (Catalog No. 77468, available 
from Advanced Genetic Technologies Corp., 19212 Orbit Drive, 
Gaithersburg, Maryland) . This kit is in the 96-well format and provides 
enough reagents for 960 purifications. Each kit is provided with a 
recommended protocol, which has been employed except for the following 
changes. First, the 96 wells are each filled with only 1 ml of sterile 
terrific broth with carbenicillin at 25 mg/L and glycerol at 0.4%. After 
the wells are inoculated, the bacteria are cultured for 24 hours and 
lysed with 60 ^1 of lysis buffer. A cent rif ugation step (2900 rpm for 5 
minutes) is performed before the contents of the block are added co the 
primary filter plate. The optional step of adding isopropanol to TRIS 
buffer is not routinely performed. After the last step in the protocol, 
samples are transferred to a Beckman 96-weli block for storage. 

Phagemid DNA was also purified using the QIAWELL-8 Plasmid 
Purification System from the QIAGEN® DNA Purification System (QIAGEN Inc, 
Chatsworth CA) . This product provides a convenient, rapid and reliable 
high-throughput method for lysing the bacterial cells and isolating 
highly purified phagemid CNA using QIAGEN anion-exchange resin particles 
with EMPORE™ membrane technology from 3M in a multiwell format. The DNA 
was eluted from the purification resin and prepared for DNA sequencing 

and other analytical manipulations. 
Fatal aplQ^n 

The phagemid forms of individual cDNA clones were obtained by the 
in vivo excision process, in which the host bacterial strain was co- 
infected with both the library phage and an fl helper phage. 
Polypeptides or enzymes derived from both the library-containing phage 
and the helper phage nicked the DNA, initiated new DNA synthesis from 
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defined sequences on the target DNA, and created a smaller, single 
stranded circular phagemid DNA molecule that included all DNA sequences 
of the pBluescript phagemid and the cDNA insert. The phagemid DNA was 
released from the ceils and purified, and used to reinfect fresh host 
cells (SOLR, Stratagene) where double-stranded phagemid DNA was produced. 
Because the phagemid carries the gene for ^-lactamase, the newly 
transformed bacteria were selected on medium containing ampicillin. 

Phagemid DNA was also purified using the QIAWELL-8 Plasmid 
Purification System from the QIAGEN® DNA Purification System (QIAGEN Inc, 
Chatsworth CA) . This product provides a convenient, rapid and reliable 
high-throughput method for lysing the bacterial cells and isolating 
highly purified phagemid DNA using QIAGEN anion-exchange resin particles 
with EMPORE^** membrane technology from 3M in a multiwell format. The DNA 
was eluted from the purification resin and prepared for DNA sequencing 
and other analytical manipulations. 

Ill HomQloov Saarchina of cDNA Clonas mnd Thoir DaducQd Protmina 

Each cDNA was compared to sequences in GenBank using a search 
algorithm developed by Applied Biosystems and incorporated into the 
INHERIT" 670 Sequence Analysis System. In this algorithm, Pattern 
Specification Language (TRW Inc, Los Angeles CA) was used to determine 
regions of homology. The three parameters that determine how the 
sequence comparisons run were window size, window offset, and error 
tolerance. Using a combination of these three parameters, the DNA 
database was searched for sequences containing regions of homology to the 
query sequence, and the appropriate sequences were scored with an initial 
value. Subsequently, these homologous regions were examined using dot 
matrix homology plots to distinguish regions of homology from chance 
matches. Smith-Waterman alignments were used to display the results of 
the homology search. 

Peptide and protein sequence homologies were ascertained using the 
INHERIT- 670 Sequence Analysis System in a way similar to that used in 
DNA sequence homologies. Pattern Specification Language and parameter 
windows were used to search protein databases for sequences containing 
regions of homology which were scored with an initial value. Dot -matrix 
homology plots were examined to distinguish regions of significant 
homology from chance matches. 

BLAST, which stands for Basic Local Alignment Search Tool (Altschul 
SF (1993) J Mol Evol 36:290-300; Altschul, SF et al (1990) J Mol Biol 
215:403-10), was used to search for local sequence alignments. BLAST 
produces alignments of both nucleotide and amino acid sequences to 
determine sequence similarity. Because of the local nature of the 
alignments, BLAST is especially useful in determining exact matches or in 
identifying homologs. BLAST is useful for matches which do not contain 
gaps. The fundamental unit of BLAST algorithm output is the High-scoring 
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Segment Pair (HSP) . 

An HSP consists of two sequence fragments of arbitrary but equal 
lengths whose alignment is locally maximal and for which the alignment 
score meets or exceeds a threshold or cutoff score set by the user. The 
BLAST approach is to look for HSPs between a query sequence and a 
database sequence, to evaluate the statistical significance of any 
matches found, and to report only those matches which satisfy the 
user-selected threshold of significance. The parameter E establishes the 
statistically significant threshold for reporting database sequence 
matches. E is interpreted as the upper bound of the expected frequency 
of chance occurrence of an HSP (or set of HSPs) within the context of the 
entire database search. Any database sequence whose match satisfies E is 
reported in the program output. 

IV r^9rthgyn Analyga-g 

Northern analysis is a laboratory technique used to detect the 
presence of a transcript of a gene and involves the hybridization of a 
labelled nucleotide sequence to a membrane on which RNAs from a 
particular ceil type or tissue have been bound (Sambrook et al . supra). 

Analogous computer techniques using BLAST (Altschul SF 1993 and 
1990, supra) are used to search for identical or related molecules in 
nucleotide databases such as GenBank or the LIFESEQ" database (Incyte, 
Palo Alto CA) . This analysis is much faster than multiple, membrane- 
based hybridizations. In addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is 
categorized as exact or homologous. 

The basis of the search is the product score which is defined as: 

^ sequgnce identity x \ maximum BLAST score 

100 

and it takes into acccount both the degree of similarity between two 
sequences and the length of the sequence match. For example, with a 
product score of 40, the match will be exact within a 1-2% error; and at 
70, the match will be exact. Homologous molecules are usually identified 
by selecting those which show product scores between 15 and 40, although 
lower scores may identify related molecules. 

V Extension of NSPLP-Encodinq pQlvnucleotidofl to Full La noth or to 

Full length NSPLP-encoding nucleic acid sequence (SEQ ID NO: 2) is 
used to design oligonucleotide primers for extending a partial nucleotide 
sequence to full length or for obtaining 5' sequences from genomic 
libraries. One primer is synthesized to initiate extension in the 
antisense direction (XLR) and the other is synthesized to extend sequence 
in the sense direction (XLF) . Primers allow the extension of the known 
NSPLP-encoding sequence "outward" generating amplicons containing new, 
unknown nucleotide sequence for the region of interest (US Patent 
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Applicarion 08/487, 112, filed June *? , 1995, specifically incorporated by 
reference) . The initial primers are designed from the cDNA using OLIGO* 
4.06 Primer Analysis Software (National Biosciences), or another 
appropriate program, to be 22-30 nucleotides in length, to have a GC 
content of 50% or more, and to anneal to the target sequence at 
temperatures about 68**-72° C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations is avoided. 

The original, selected cDNA libraries, or a human genomic library 
are used to extend the sequence; the latter is most useful to obtain 5* 
upstream regions. If more extension is necessary or desired, additional 
sets of primers are designed to further extend the known region. 

By following the instructions for the XL- PGR kit (Per kin Elmer) and 
thoroughly mixing the enzyme and reaction mix, high fidelity 
amplification is obtained. Beginning with 40 pmoi of each primer and the 
recommended concentrations of all other components of the kit, PGR is 
performed using the Peltier Thermal Cycler (PTC200; MJ Research, 
Watertown MA) and the following parameters: 



Step 


1 


94** C for 1 min (initial denacuration) 


Step 


2 


65' C for 1 rain 


Step 


3 


68" C for 6 min 


Step 


4 


94** C for 15 sec 


Step 


5 


65** G for 1 min 


Step 


6 


68** C for 7 min 


Step 


7 


Repeat step 4-6 for 15 additional cycles 


Step 


8 


9,4' C for 15 sec 


Step 


9 


65' C for 1 min 


Step 


10 


68* C for 7:15 min 


Step 


11 


Repeat step 8-10 for 12 cycles 


Step 


12 


72** C for 8 min 


Step 


13 


4' C (and holding) 



A 5-10 m1 aliquot of the reaction mixture is analyzed by 
electrophoresis on a low concentration (about 0.6-0.8%! agarose mini-gel 
to determine which reactions were successful in extending the sequence. 
Bands thought to contain the largest products were selected and cut out 
of the gel. Further purification involves using a commercial gel 
extraction method such as QIAQuick^ (QIAGEN Inc) . After recovery of the 
DNA, Klenow enzyme was used to trim single-stranded, nucleotide overhangs 
creating blunt ends which facilitate religation and cloning. 

After ethanol precipitation, the products are redissolved in 13 fxX 
of ligation buffer, l^^l T4-DNA ligase (15 units) and 1^1 T4 
polynucleotide kinase are added, and the mixture is incubated at room 
temperature for 2-3 hours or overnight at 16° C, Competent coli cells 
(in 40 /il of appropriate media) are transformed with 3 /il of ligation 
mixture and cultured in 80 /ii of SOC medium (Sambrook J et al, supra) . 
After incubation for one hour at 37* C, the whole transformation mixture 
is plated on Luria Bertani (LB) -agar (Sambrook J et al, supra) containing 
2xGarb. The following day, several colonies are randomly picked from 
each plate and cultured in 150 of liquid LB/2xCarb medium placed in an 
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individual well of an appropriate, commercially-available, sterile 96- 
well microtiter plate. The following day, 5 m1 of each overnight culture 
is transferred into a non-sterile 96-weil plate and after dilution 1:10 
with water, 5 ^1 of each sample is transferred into a PCR array. 
5 For PCR amplification, 18 ^1 of concentrated PCR reaction mix 

(3.3x) containing 4 units of rTth DNA polymerase, a vector primer and one 
or both of the gene specific primers used for the extension reaction are 
added to each well. Amplification is performed using the following 
conditions: 

10 Step 1 94^ C for 60 sec 

Step 2 94' C for 20 sec 

Step 3 55° C for 30 sec 

Step 4 72" C for 90 sec 

Step 5 Repeat steps 2-4 for an additional 29 cycles 

15 Step 6 72" C for 180 sec 

Step 7 4° C (and holding) 

Aliquots of the PCR reactions are run on agarose gels ' together 
with molecular weight markers. The sizes of the PCR products are 
compared to the original partial cDNAs, and appropriate clones are 

20 selected, ligated into piasmid and sequenced. 

VI LabirliPT of Hybridization Probes 

Hybridization probes derived from SEQ ID NO: 2 are employed to 
screen cDNAs, genomic DNAs or mRNAs . Although the labeling of 
oligonucleotides, consisting of about 20 base-pairs, is specifically 

25 described, essentially the same procedure is used with larger cDNA 

fragments. Oligonucleotides are designed using state-of-the-art software 
such as OLIGO 4.06 (National Biosciences), labeled by combining 50 pmol 
of each oligomer and 250 mCi of (y-"P] adenosine triphosphate (Amersham, 
Chicago ID and T4 polynucleotide kinase (DuPont NEN*, Boston MA) . The 

30 labeled oligonucleotides are substantially purified with Sephadex G-25 

super fine resin column (Pharmacia) . A portion containing 10' counts per 
minute of each of the sense and antisense oligonucleotides is used in a 
typical membrane based hybridization analysis of human genomic DNA 
digested with one of the following endonucleases (Ase I, Bgl II, Eco RI , 

35 Pst I, Xba 1, or Pvu II; DuPont NEN*) . 

The DNA from each digest is fractionated on a 0.7 percent agarose 
gel and transferred to nylon membranes (Nytran Plus, Schleicher & 
Schuell, Durham NH) . Hybridization is carried out for 16 hours at 40*C. 
To remove nonspecific signals, blots are sequentially washed at room 

40 temperature under increasingly stringent conditions up to 0. 1 x saline 

sodium citrate and 0.5% sodium dodecyl sulfate. After XOMAT AR** film 
(Kodak, Rochester NY) is exposed to the blots in a Phosphoimager cassette 
(Molecular Dynamics, Sunnyvale CA) for several hours, hybridization 
patterns are compared visually. 

45 

VXI Antisense Moleculea 
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The NSPLP-encoding sequence, or any part thereof, is used to 
inhibit in vivo or XR vitro expression of naturally occurring NSPLP. 
Although use of antisense oligonucleotides, comprising about 20 base- 
pairs, is specifically described, essentially the same procedure is used 
with larger cDNA fragments. An oligonucleotide based on the coding 
sequences of NSPLP, as shown in Figs. lA, IB, 2A, and 2B is used to 
inhibit expression of naturally occurring NSPLP. The complementary 
oligonucleotide is designed from the most unique 5' sequence as shown in 
Figures lA, IB, 2A, and 2B and used either to inhibit transcription by 
preventing promoter binding to the upstream nontransiated sequence or 
translation of an NSPLP-encoding transcript by preventing the r-ibosome 
from binding. Using an appropriate portion of the leader and 5' sequence 
of SEQ ID NO: 2, an effective antisense oligonucleotide includes any 15-20 
nucleotides spanning the region which translates into the signal or early 
coding sequence of the polypeptide as shown in Figures lA, IB, 2A, and 
2B. 

VXXI Exnraaaion of NSPLP 

Expression of the NSPLP is accomplished by subcioning the cDNAs 
into appropriate vectors and transfecting the vectors into host cells. 
In this case, the cloning vector, pSport, previously used for the 
generation of the cDNA library is used to express NSPLP in £. coli . 
Upstream of the cloning site, this vector contains a promoter for 
ft-galactosidase, followed by sequence containing the amino-terminal Met 
and the subsequent 7 residues of fi-galactosidase . Immediately following 
these eight residues is a bacteriophage promoter useful for transcription 
and a linker containing a number of unique restriction sites. 

I.'iduction of an isolated, transfected bacterial strain with IPTG 
using standard methods produces a fusion protein which consists of the 
firsL seven residues of Ji-galactosidase, about 5 to 15 residues of 
linker, and the full length NSPLP-encoding sequence. The signal sequence 
directs the secretion of NSPLP into the bacterial growth media which can 
be used directly in the following assay for activity. 

IX NSPLP Aefcivitv 

NSPLP' s ER targeting activity can be assessed by a method of van 
de Velde et al (1994, supra). Microsomes are collected from cells 
expressing NSPLP by a 100,000 g spin in a method described by Verboomen H 
et al (1992 Biochem J 286:591-596). After treatment with 0.5 M KCl and 
centrif ugation the pellet is resuspended and subject to gel 
electrophoresis. Western blot analysis using antibodies to NSPLP reveals 
the presence of NSPLP in the ER membrane. 



X Production of MSPLP Spacifie Antlbodlaa 
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NSPLP substantially purified using PAGE electrophoresis (Sambrook, 
supra) is used to immunize rabbits and to produce antibodies using 
standard protocols. The amino acid sequence translated from NSPLP is 
analyzed using DNAStar software (DNAStar Inc) to determine regions of 
high immunogenicity and a corresponding oligopolypeptide is synthesized 
and used to raise antibodies by means icnown to those of skill in the art. 
Analysis to select appropriate epitopes, such as those near the 
C-terminus or in hydrophilic regions (shown in Figs. 7 and 8) is 
described by Ausubel FM et al (supra) . 

Typically, the oligopeptides are 15 residues in length, 
synthesized using an Applied Biosystems Peptide Synthesizer Model 431A 
using fmoc-chemistry, and coupled to keyhole limpet hemocyanin (KLH, 
Sigma) by reaction with M-maleimidobenzoyl-N-hydroxysuccinimide ester 
(MBS; Ausubel FM et al, supra) . Rabbits are immunized with the 
oligopeptide-KLH complex in complete Freund's adjuvant. The resulting 
antisera are tested for antipeptide activity, for example, by binding the 
peptide to plastic, blocking with 1% BSA, reacting with rabbit antisera, 
washing, and reacting with radioiodinated, goat anti-rabbit IgG. 

XI Purlfical^lon of Naturally Qgeurrina NSPLP Using Specific 

Naturally occurring or recombinant NSPLP is substantially purified 
by immunoaf f inity chromatography using antibodies specific for NSPLP. An 
immunoaf finity column is constructed by covaiently coupling NSPLP 
antibody to an activated chromatographic resin such as CnBr-activated 
Sepharose (Pharmacia Biotech). After the coupling, the resin is blocked 
and washed according to the manufacturer's instructions. 

Media containing NSPLP is passed over the immunoaf finity column, 
and the column is washed under conditions that allow the preferential 
absorbance of NSPLP (eg. high ionic strength buffers in the presence of 
detergent) . The column is eluted under conditions that disrupt 
antibody/NSPLP binding (eg, a buffer of pH 2-3 or a high concentration of 
a chaotrope such as urea or thiocyanate ion), and NSPLP is collected. 

XII Idantification of Molttculea Which Interact with HSPLP 

NSPLP, or biologically active fragments thereof, are labelled with 
Bolton-Hunter reagent (Bolton, AE and Hunter, WM (1973) Biochem J 133: 
529) . Candidate molecules' previously arrayed in the wells of a 96 well 
plate are incubated with the labelled NSPLP, washed and any wells with 
labelled NSPLP complex are assayed. Data obtained using different 
concentrations of NSPLP are used to calculate values for the number, 
affinity, and association of NSPLP with the candidate molecules. 

All publications and patents mentioned in the above specification 
are herein incorporated by reference. Various modifications and 
variations of the described method and system of the invention will be 
apparent to those skilled in the art without departing from the scope and 
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spirit of the invention. Although the invention has been described in 
connection with specific preferred embodiments, it should be understood 
that the invention as claimed should not be unduly limited to such 
specific embodiments. Indeed, various modifications of the described 
5 modes for carrying out the invention which are obvious to those skilled 

in molecular biology or related fields are intended to be within the 
scope of the following claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: INCYTE PHARMACEUTICALS, INC. 

(ii) TITLE OF THE INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 

(iii) NUMBER OF SEQUENCES: 9 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: U.S. 

(F) ZIP: 94304 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) PCT APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Filed Herewith 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/700,607 

(B) FILING DATE: AUGUST 12, 1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE/DOCKET NUMBER: PF-0114 PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-855-0555 

(B) TELEFAX; 415-845-4166 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE: Consensus 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



Met 


Asp 


Gly 


Gin 


Lys 


Lys 


Asn 


Trp 


Lys 


Asp 


Lys 


Val 


Val 


Asp 


Leu 


Leu 


1 








5 










10 










15 




Tyr 


Trp Arg 


Asp 


He 


Lys 


Lys 


Thr 


Gly Val Val 


Phe Gly Ala 


Ser 


Leu 








20 










25 










30 






Phe 


Leu 


Leu 
35 


Leu 


Ser 


Leu 


Thr 


Val 
40 


Phe 


Ser 


He 


Val 


Ser 
45 


Val 


Thr 


Ala 


Tyr 


He 
50 


Ala 


Leu 


Ala 


Leu 


Leu 
55 


Ser 


Val 


Thr 


He 


Ser 
60 


Phe 


Arg 


He 


Tyr 


Lys 


Gly 


Val 


He 


Gin 


Ala 


He 


Gin 


Lys 


Ser 


Asp 


Giu 


Gly 


His 


Pro 


Phe 


65 










70 










75 










80 


Arg Ala 


Tyr 


Leu 


Glu 


Ser 


Glu 


Val 


Ala 


He 


Ser 


Glu 


Glu 


Leu 


Val 


Gin 








85 










90 










95 




Lys 


Tyr 


Ser 


Asn 
100 


Ser 


Ala 


Leu 


Gly 


His 
105 


Val 


Asn 


Cys 


Thr 


He 
110 


Lys 


Glu 


Leu Arg 


Arg 


Leu 


Phe 


Leu 


Val 


Asp 


Asp 


Leu 


Val 


Asp 


Ser 


Leu 


Lys 


Phe 






115 










120 










125 








Ala 


Vai 


Leu 


Met 


Trp 


Val 


Phe 


Thr 


Tyr 


Val 


Gly 


Ala 


Leu 


Phe 


Asn Gly 




130 








135 










140 










Leu 


Thr 


Leu 


Leu 


lie 


Leu 


Ala 


Leu 


He 


Ser 


Leu 


Phe 


Ser 


Val 


Pro 


Val 


145 










150 










155 










160 


He 


Tyr 


Glu 


Arg 


His 
165 


Gin 


Ala 


Gin 


He 


Asp 

no 


His 


Tyr 


Leu 


Gly 


Leu 

175 


Ala 


Asn 


Lys 


Asp. 


Val 
180 


Lys 


Asp 


Ala 


Met 


Ala 
185 


Lys 


He 


Gin 


Ala 


Lys 
190 


He 


Pro 


Gly 


Leu 


Lys 
195 


Arg 


Lys 


Ala 


Glu 





















(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single* 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE: Consensus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GGTTTGTGCA GTTACAGCTT TTCTNTTGGT ATGCATAATT AATANTTGGA GCTGCAAAGA 60 
GATCGTGACA AGAGATGGAC GGTCAGAAGA AAAATTGGAA GGACAAGGTT GTTGACCTCC 120 
TGTACTGGAG AGACATTAAG AAGACTGGAG TGGTGTTTGG TGCCAGCCTA TTCCTGCTGC 180 
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TTTCATTGAC AGTATTCAGC ATTGTGAGCG TAACAGCCTA CATTGCCTTG GCCCTGCTCT 240 

CTGTGACCAT CAGCTTTAGG ATATACAAGG GTGTGATCCA AGCTATCCAG AAATCAGATG 300 

AAGGCCACCC ATTCAGGGCA TATCTGGAAT CTGAAGTTGC TATATCTGAG GAGTTGGTTC 360 

AGAAGTACAG TAATTCTGCT CTTGGTCATG TGAACTGCAC GATAAAGGAA CTCAGGCGCC 420 

TCTTCTTAGT TGATGATTTA GTTGATTCTC TGAAGTTTGC AGTGTTGATG TGGGTATTTA 480 

CCTATGTTGG TGCCTT6TTT AATGGTCTGA CACTACTGAT TTTGGCTCTC ATTTCACTCT 540 

TCAGTGTTCC TGTTATTTAT GAACGGCATC AGGCACAGAT AGATCATTAT CTAGGACTTG 600 

CAAATAAGAA TGTTAAAGAT GCTATGGCTA AAATCCAAGC AAAAATCCCT GGATTGAAGC 660 

GCAAAGCTGA ATGAAAACGC CCAAAATAAT TAGTAGGAGT TCATCTTTAA AGGGGATATT 720 

CATTTGATTA TACGGGGGAG GGTCAGGGAA GAACGACCTT GACGTTGCAG TGCAGTTTCA 780 

CAGATCGTTG TTAGATCTT 799 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: THPlNOBOl 

(B) CLONE: 31870 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Met 


Ala 


Giu 


Arg 


Xaa 


Ala 


Ala 


Thr 


Gin Ser 


His 


Ser 


He 


Ser 


Ser 


Ser 


1 






5 








10 










15 




Ser 


Phe 


Gly 


Ala 


Glu 


Pro 


Ser 


Ala 


Pro Gly 


Gly 


Gly Gly 


Ser 


Pro 


Gly 






20 










25 








30 






Ala 


Cys 


Pro 


Ala 


Leu 


Gly 


Thr 


Lys 


Ser Cys 


Ser 


Ser 


Ser 


Cys 


Ala 


Val 




35 










40 








45 








His 


ASD 


Leu 


lie 


Xaa 


Trp 


Arg 


Asp 


Val Lys 


Lys 


Thr 


Gly 


Phe 


Val 


Phe 




50* 










55 








60 










Gly Thr 


Thr 


Leu 


He 


Met 


Leu 


Leu 


Ser Leu 


Ala 


Ala 


Phe 


Ser 


Vai 


He 


65 










70 








75 










60 


Ser 


Val 


Val 


Ser 


Tyr 


Leu 


He 


Leu 


Ala Leu 


Leu 


Ser 


Val 


Thr 


He 


Ser 










85 








90 










95 




Phe 


Arg 


lie 


Tyr 


Lys 


Ser 


Val 


He 


Gin Ala 


Val 


Gin 


Lys 


Ser 


Glu 


Glu 






100 








105 








110 






Gly His 


Pro 


Phe 


Lys 


Ala 


Tyr 


Leu Asp Val 


Asp 


He 


Thr 


Leu 


Ser 


Ser 






115 








120 








125 








Giu 


Ala 


Phe 


His 


Asn 


Tyr 


Met 


Asn 


Ala Ala 


Met 


Val 


His 


He 


Asn 


Arg 




130 










135 








140 










Ala 


Leu 


Lys 


Leu 


He 


He 


Arg 


Leu 


Phe Leu 


Val 


Glu Asp 


Leu 


Val 


Asp 


145 








150 








155 










160 


Ser 


Leu 


Lys 


Leu 


Ala 


Val 


Phe 


Met Trp Leu 


Met 


Thr 


Tyr 


Val 


Gly Ala 








165 








170 










175 




Val 


Phe 


Asn 


Gly 


He 


Thr 


Leu 


Leu 


He Leu 


Ala 


Glu 


Leu 


Leu 


He 


Xaa 








180 










185 








190 






Ser 


Val 


Pro 


lie 


Val 


Tyr 


Xaa 


Lys 


Tyr Lys 


Val 


Pro 


Ser 


Lys 


Thr 


Pro 






195 










200 








205 








Trp 


Asn 


Arg 


Gin 


Lys 


Lys 


Gly 


Arg 


He Ser 


Thr 


Trp 


Lys 


Pro 


Glu 


Met 




210 










215 








220 










Gin 


Gin 


Leu 


Leu 


Lys 


His 


His 


Leu 


He Val 


He 


Thr 


Ser 


Leu 


Leu 


Val 


225 








230 








235 










240 



Leu 



(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: THPlNOBOl 

(B) CLONE: 31870 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ACACNAGCGN NTCGNGCTCC CGAACCTCTA GCTGCGACTC GGANTGAGTC AGTCAGTCTG 60 

TCGGAGTCTG TCCTCGGAGC AGGCGGAGTA AAGGGACTTG AGCGAGCCAG TTGCCGGATT 120 

ATTCTATTTC CCCTCCCTCT CTCCCGCCCC GTATCTCTTT TCATTTTNNT NCCACCCTTG 180 

CTCGCGTANC ATGGCGGAGC GTNCGGCGGC CACTCAGTCC CATTCCf^TCT CCTCGTCGTC 240 

CTTCGGAGCC GAGCCGTCCG CGCCCGGCGG CGGCGGGAGC CCAGGAGCCT GCCCCGCCCT 300 

GGGGACGAAG AGCTGCAGCT CCTCCTGTGC GGTGCACGAT CTGATTTTMT GGAGAGATGT 360 

GAAGAAGACT GGGTTTGTCT TTGGCACCAC GCTGATCATG CTGCTTTCCC TGGCAGCTTT 4 20 

CAGTGTCATC AGTGTGGTTT CTTACCTCAT CCTGGCTCTT CTCTCTGTCA CCATCAGCTT 480 

CAGGATCTAC AAGTCCGTCA TCCAAGCTGT ACAGAAGTCA GAAGAAGGCC ATCCATTCAA 540 
AGCCTACCTG GACGTAGACA TTACTCTGTC CTCAGAAGCT TTCCATAATT ACATGAATGC 600 
TGCCATGGTG CACATCAACA GGGCCCTGAA ACTCATTATT CGTCTCTTTC TGGTAGAAGA 660 
TCTGGTTGAC TCCTTGAAGC TGGCTGTCTT CATGTGGCTG ATGACCTATG TTGGTGCTGT 720 
TTTTAACGGA ATCACCCTTC TAATTCTTGC TGAACTGCTC ATTTTNAGTG TCCCGATTGT 780 
NTATNAGAAG TACAAGGTTC CAAGCAAAAC TCCCTGGAAT CGCCAAAAAA AAGGCAGAAT 84 0 
AAGTACATGG AAACCAGAAA TGCAACAGTT ACTAAAACAC CATTTAATAG TTATAACGTC 900 
GTTACTTGTA CTATGAAGGA AAATACTCAG TGTCAGCTTG AGCCTGCATT CCAAGCTTTT 960 

TTTTTAATTT GGTGGTTTTC TCCCATCCTT TCCCTTTAAC CCTCAGTNTC AAGCACAAAN 1020 

TTTNATGGAC TGATAANNGA TCTATNTTAG ANCTCAGAAG ANGANAGNTT CANNTGCATA 1080 

GGNTAAGGNA NTACC 1095 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 6 amino acids 

(B) TYPE: amino acid 

iC) STRANDEDNESS: single 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 307307 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Met 


Ala 


Ala 


Pro 


Gly 


Asp 


Pro 


Gin 


Asp 


Glu 


Leu Leu Pro Leu Ala Gly 


1 








5 










10 




15 




Pro 


Giy Ser 


Gin 


Trp 


Leu Arg 


His 


Arg 


Gly 


Glu Gly Glu Asn 


Glu 


Ala 








20 










25 




30 






Val 


Thr 


Pro 


Lys 


Gly 


Ala 


Thr 


Pro 


Ala 


Pro 


Gin Ala Gly Glu 


Pro 


Ser 






35 








40 






45 






Pro 


Gly 
50 


Leu 


Gly 


Ala 


Arg 


Ala 
55 


Arg 


Glu 


Ala 


Ala Ser Arg Glu 
60 


Ala 


Gly 


Ser 


Gly 


Pro 


Ala 


Arg 


Gin 


Ser 


Pro 


Val 


Ala 


Met Glu Thr Ala 


Ser 


Thr 


65 










70 










75 




80 


Gly 


Val 


Ala 


Gly 


Val 
85 


Ser 


Ser 


Ala 


Met 


Asp 
90 


His Thr Phe Ser 


Thr 
95 


Thr 
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Ser Lys Asp Giy Glu Gly Ser Cys Tyr Thr Ser Leu lie Ser Asp lie 

100 105 110 

Cys Tyr Pro Pro Gin Glu Asp Ser Thr Tyr Phe Thr Gly lie Leu Gin 

115 120 125 

Lys Glu Asn Gly His Val Thr He Ser Glu Ser Pro Glu Glu Leu Gly 

130 135 140 

Thr Pro Gly Pro Ser Leu Pro Asd Val Pro Gly lie Glu Ser Arg Gly 
145 150 " 155 160 

Leu Phe Ser Ser Asp Ser Gly lie Glu Met Thr Pro Ala Glu Ser Thr 

165 110 175 

Glu Val Asn Lys lie Leu Ala Asp Pro Leu Asp Gin Met Lys Ala Glu 

180 185 190 

Ala Tyr Lys Tyr He Aso He Thr Arg Pro Glu Glu Val Lys His Gin 

195 ' 200 205 

Glu Gin His His Pro Glu Leu Glu Asp Lys Asp Leu Asp Phe Lys Asn 

210 215 220 

Lys Aso Thr Asp He Ser lie Lys Pro Glu Gly Val Arg Glu Pro Asp 
225 * ' 230 235 240 

Lys Pro Ala Pro Val Glu Giy Lys He He Lys Asp "His Leu Leu Glu 

245 250 255 

Glu Ser Thr Phe Ala Pro Tyr He Asp Asp Leu Ser Glu Glu Gin Arg 

260 265 270 

Arg Ala Pro Gin He Thr Thr Pro Val Lys lie Thr Leu Thr Glu He 

275 280 285 

Glu Pro Ser Val Glu Thr Thr Thr Gin Glu Lys Thr Pro Glu Lys Gin 

290 295 300 

Asp He Cys Leu Lys Pro Ser Pro Asp Thr Val Pro Thr Val Thr Val 
305 310 315 320 

Ser Glu Pro Glu Aso Asp Ser Pro Gly Ser He Thr Pro Pro Ser Ser 

325 330 335 

Gly Thr Glu Pro Ser Ala Ala Glu Ser Gin Gly Lys Gly Ser He Ser 

340 345 350 

Glu Asp Glu Leu He Thr Ala He Lys Glu Ala Lys Giy Leu Ser Tyr 

355 360 365 

Glu Thr Ala Glu Asn Pro Arg Pro Val Gly Gin Leu Ala Asp Arg Pro 

370 375 380 

Glu Val Lys Ala Arg Ser Giy Pro Pro Thr He Pro Ser Pro Leu Asp 
385 390 395 400 

His Giu Ala Ser Ser Ala Glu Ser Gly Aso Ser Glu He Glu Leu Val 

405 410 415 

Ser Glu Aso Pro Met Ala Ala Giu Asp Ala Leu Pro Ser Gly Tyr Val 

420 425 430 

Ser Phe Gly His Val Gly Giy Pro Pro Pro Ser Pro Ala Ser Pro Ser 

435 440 445 

He Gin Tyr Ser He Leu Arg Glu Glu Arg Glu Ala Glu Leu Asp Ser 

450 455 460 

Glu Leu He He Glu Ser Cys Asp Ala Ser Ser Ala Ser Giu Glu Ser 
465 470 475 480 

Pro Lys Arg Glu Gin Asp Ser Prg Pro Met Lys Pro Ser Ala Leu Asp 

485 490 495 

Ala He Arg Glu Glu Thr Gly Val Arg Ala Glu Giu Arg Ala Pro Ser 

500 505 510 

Arg Arg Gly Leu Ala Glu Pro Gly Ser Phe Leu Asp Tyr Pro Ser Thr 

515 520 525 

Giu Pro Gin Pro Giy Pro Giu Leu Pro Pro Gly Asp Gly Ala Leu Giu 

530 535 540 

Pro Glu Thr Pro Met Leu Pro Arg Lys Pro Glu Glu Asp Ser Ser Ser 
545 550 555 560 

Asn Gin Ser Pro Ala Ala Thr Lys Giy Pro Giy Pro Leu Giy Pro Gly 

565 570 575 

Ala Pro Pro Pro Leu Leu Phe Leu Asn Lys Gin Lys Ala He Asp Leu 
580 585 590 
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Leu 


Tyr Tro 


Arg 




lie 


Lvs 


Gin 




595 










600 


Leu 


Leu Leu 


Leu 


Phe 


Ser 


Leu 


Thr 




610 








615 




Ala 


Tyr Leu 


Ala 


Leu 


Ala 


Ala 


Leu 


625 






630 






Tyr 


Lys Ser 


Val 




Gin 


Ala 


Val 






645 








Phe Lys Ala Tyr 




Glu 


Leu 


Glu 






660 










Gin 


Lys Tyr 


Thr 


Asp 






Gin 




675 










680 


Glu 


Leu Arg 


Arg 


Leu 


rue 


L^eu 


vox 




690 






695 




Phe 


Ala Val 


Leu 


Met 


Trp 


Leu 


Leu 


705 








710 






Gly 


Leu Thr 


Leu 


Leu 


Leu 


Met 


Ala 






725 








Val 


Val Tyr 


Val 


Lys 


His 


Gin 


Ala 






740 










Val 


Arg Thr 


His 


lie 


Asn 


Ala 


Val 




755 










7 60 


Pro 


Gly Ala 


Lys 


Arg 


His 


Ala 


Glu 




770 








775 





Thr 


Gly lie 


Val 


Phe 
605 


Glv 


Ser 


Phe 


Gin 


Phe Ser 


Val 
620 


Val 


Ser 


Val 


Val 


Ser 


Ala Thr 
635 


lie 


Ser 


Phe 


Arg 


lie 
640 


Gin 




Asp Glu 




His 


Pro 




650 








655 




Tip 
X X t= 


1 1 11. u 


Ser 


Gin 




Gin 


lie 


665 








670 






IT ne 


1 y r V ax 


Asn 


Ser 
685 


THt- 

A nr 


Leu 




Gin 


Asp Leu 


Val 
700 


Asp 


Ser 


Leu 


Lys 


Thr 


Tyr Val 


Gly Ala 


Leu 


Phe 


Asn 




715 










720 


Val 


Val Ser 
730 


Met 


Phe 


Thr 


Leu 
735 


Pro 


Gin 


lie Asp 


Gin 


Tyr 


Leu 


Gly 


Leu 


745 








750 






Val 


Ala Lys 


lie 


Gin 
765 


Ala 


Lys 


lie 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE; 307309 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Ala Glu Asp Ala Leu Pro Ser Gly Tyr Val Ser Phe Gly His 

15 10 15 

Val Gly Gly Pro Pro Pro Ser Pro Ala Ser Pro Ser lie Gin Tyr Ser 

20 25 30 

lie Leu Arg Glu Glu Arg Glu Ala Glu Leu Asp Ser Glu Leu lie lie 

35 40 45 

Glu Ser Cys Asp Ala Ser Ser Ala Ser Glu Glu Ser Pro Lys Arg Glu 

50 55 - 60 

Gin Asp Ser Pro Pro Met Lys Pro Ser Ala Leu Asp Ala lie Arg Glu 
65 70 75 80 

Glu Thr Gly Val Arg Ala Glu Glu Arg Ala Pro Ser Arg Arg Gly Leu 

85 90 95 

Ala Glu Pro Gly Ser Phe Leu Asp Tyr Pro Ser Thr Glu Pro Gin Pro 

100 105 110 

Gly Pro Glu Leu Pro Pro Gly Asp Gly Ala Leu Glu Pro Glu Thr Pro 

115 120 125 

Met Leu Pro Arg Lys Pro Glu Glu Asp Ser Ser Ser Asn Gin Ser Pro 

130 135 140 

Ala Ala Thr Lys Gly Pro Gly Pro Leu Gly Pro Gly Ala Pro Pro Pro 
145 150 155 160 

Leu Leu Phe Leu Asn Lys Gin Lys Ala lie Asp Leu Leu Tyr Trp Arg 
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165 170 175 



Asp 


He 


Lys Gin Thr Giy 


He Val Phe Gly 


ber . 


Phe 


Leu 


Leu 


Leu 


Leu 






180 




185 










1 on 






Phe 


Ser 


Leu 


Thr 


Gin Phe 


Ser Val Val 


Ser 


vai 


Val 


Ala 


Tyr 


Leu 


Axa 






195 






200 








205 








Leu 


Ala 


Ala 


Leu 


Ser Ala 


Thr He Ser 


Phe 


Arg 


He Tyr 


Lys 


Ser 


Val 




210 








215 






220 










Leu 


Gin 


Ala 


Val 


Gin Lys 


Thr Asp Glu 


Gly 


His 


Pro 


Phe 


Lys 


Ala 


Tyr 


225 








230 






235 










240 


Leu 


Giu 


Leu 


Glu 


lie Thr 


Leu Ser Gin 


Glu 


Gin 


He 


Gin 


Lys 


Tyr 


Thr 










245 




250 










255 




Asp Cys 


Leu 


Gin 


Phe Tyr 


Val Asn Ser 


Thr 


Leu 


Lys 


Glu 


Leu 


Arg 


Arg 








260 


265 










270 






Leu 


Phe 


Leu 


Val 


Gin Asp 


Leu Val Asp Ser 


Leu 


Lys 


Phe 


Ala 


Val 


Leu 






275 




260 








265 








Met 


Trp 


Leu 


Leu 


Thr Tyr 


Val Gly Ala 


Leu 


Phe Asn Gly Leu Thr Leu 




290 






295 






300 










Leu 


Leu 


Met 


Ala 


Val Val 


Ser Met Phe 


Thr 


Leu 


Pro 


Val 


Val 


Tyr 


Val 


305 








310 






315 










320 


Lys 


His 


Gin 


Ala 


Gin He 


Asp Gin Tyr 


Leu 


Gly 


Leu 


Val 


Arg 


Thr 


His 








325 




330 










335 




He 


Asn 


Ala 


Val 


Val Ala 


Lys lie Gin 


Ala 


Lys 


He 


Pro 


Gly 


Ala 


Lys 








340 




345 










350 






Arg 


His 


Ala 


Glu 





















355 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 307311 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



Met 


Gin 


Ala 


Thr 


Ala 


Asp 


Ser 


Thr 


Lys 


Met 


Asp 


Cys 


Val 


Trp 


Ser 


Asn 


1 








5 








10 










15 




Trp 


Lys 


Ser 


Gin 


Ala 


He 


Asp 


Leu 


Leu 


Tyr Trp Arg Asp 


lie 


Lys Gin 






20 










25 










30 






Thr 


Gly 


He 


Val 


Phe 


Gly 


Ser 


Phe 


Leu 


Leu 


Leu 


Leu 


Phe 


Ser 


Leu 


Thr 




35 








40 










45 








Gin 


Phe 


Ser 


Val 


Val 


Ser 


Val 


Val 


Ala 


Tyr 


Leu 


Ala 


Leu 


Ala 


Ala 


Leu 




50 










55 










60 










Ser 


Ala 


Thr 


He 


Ser 


Phe 


Arg 


He 


Tyr 


Lys 


Ser 


Val 


Leu 


Gin 


Ala 


Val 


65 










70 










75 










80 


Gin 


Lys 


Thr 


Asp 


Glu 


Gly 


His 


Pro 


Phe 


Lys 


Ala 


Tyr 


Leu 


Glu 


Leu 


Glu 










85 










90 










95 




He 


Thr 


Leu 


Ser 


Gin 


Glu 


Gin 


He 


Gin 


Lys Tyr Thr Asp Cys 


Leu 


Gin 








100 










105 










110 






Phe 


Tyr 


Val 


Asn 


Ser 


Thr 


Leu 


Lys 


Glu 


Leu 


Arg 


Arg 


Leu 


Phe 


Leu 


Val 




115 










120 










125 








Gin 


Asp 


Leu 


Val 


Asp 


Ser 


Leu 


Lys 


Phe 


Ala 


Val 


Leu 


Met 


Trp 


Leu 


Leu 




130 










135 










140 










Thr 


Tyr 


Val 


Gly 


Ala 


Leu 


Phe 


Asn 


Gly 


Leu 


Thr 


Leu 


Leu 


Leu 


Met 


Ala 


145 






150 










155 










160 


Val 


Val 


Ser 


Met 


Phe 


Thr 


Leu 


Pro 


Val 


Val 


Tyr 


Val 


Lys 


His 


Gin 


Ala 
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165 170 175 

Gin lie Asp Gin Tyr Leu Gly Leu Val Arg Thr His lie Asn Ala Val 

180 18S 190 

Val Ala Lys He Gin Ala Lys He Pro Gly Ala Lys Arg His Ala Glu 

195 200 205 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 281046 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Asp Cys Val Tro Ser Asn Trp Lys Ser Gin Ala lie Asp Leu Leu 

1 5' 10 15 

Tyr Trp Arg Asp He Lys Gin Thr Gly He Val Phe Gly Ser Phe Leu 

20 25 30 

Leu Leu Leu Phe Ser Leu Thr Gin Phe Ser Val Val Ser Val Val Ala 

35 40 45 

Tyr Leu Ala Leu Ala Ala Leu Ser Ala Thr He Ser Phe Arg He Tyr 

50 55 60 

Lys Ser Val Leu Gin Ala Val Gin Lys Thr Asp Glu Gly His Pro Phe 
65 70 75 80 

Lys Ala Tyr Leu Glu Leu Glu He Thr Leu Ser Gin Glu Gin He Gin 

85 90 95 

Lys Tyr Thr Asp Cys Leu Gin Leu Tyr Val Asn Ser Thr Leu Lys Glu 

100 105 110 

Leu Arg Arg Leu Phe Leu Val Gin Asp Leu Val Asp Ser Leu Lys Phe 

115 120 125 

Ala Val Leu Met Tro Leu Leu Thr Tyr Val Gly Ala Leu Phe Asn Gly 

130 ' 135 140 

Leu Thr Leu Leu Leu Met Ala Val Val Ser Met Phe Thr Leu Pro Val 
145 150 155 160 

Val Tyr Val Lys His Gin Ala Gin Val Asp Gin Tyr Leu Gly Leu Val 

165 no 175 

Arg Thr His He Asn Thr Val Val Ala Lys He Gin Ala Lys He Pro 

180 185 190 

Gly Ala Arg Gly Met Leu Ser Arg Trp Leu Pro Gin Glu Lys Pro Asp 

195 200 205 

Met Asn Gly Gly Val Trp Ser Gly Asn Ser Ser Leu Leu Pro Arg Tyr 

210 215 . 220 

Cys Glu Leu He Val Ser Leu Pro Gin Tyr His Asn Leu Arg Gly Lys 
225 230 235 240 

Leu Arg Asp Arg Cys Phe Gin Ser Phe Pro Val Leu Leu Gly Tyr Leu 

245 250 255 

Ser Pro Pro Arg Pro Leu Ser Ser Thr Lys Val 
260 265 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: SPLNFETOl 

(B) CLONE: 28742 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CCTATNCCNG CTGCTTTCAT TGACAGTATT CAGCATTGTG AGCGTAACAG CCTACATTGC 60 

CTTNGCCCTG CNCTCTGTGA CCATCAGCTN TAGGCTATAC AAGGGTGTGA TCCAAGCTAT 120 

CCAGAAATCA GATGAAGGNC ACCCATTCAG GGCATATCTG GANTCTGAAG TTGCTATATC ' 180 

TGAGGAGTTG NTTCAGAAGT ACACGTAAAT NNTGNNCNTG GTCAATGTGA NCTCCACGNC 240 

TAANGGANCT CAGGTGCCTA T 261 
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CIAIMS 

1. A substantially purified human NSPLP protein comprising the 
amino acid sequence of SEQ ID N0:1 or fragments thereof. 

2. An isolated and purified polynucleotide sequence encoding a 
protein of claim 1. 

3. An isolated and purified polynucleotide sequence of claim 2 
consisting of the sequence of SEQ ID NO: 2 or degenerate variants thereof. 

4. A polynucleotide sequence fully complementary to the sequence 
of SEQ ID NO: 2 or degenerate variants thereof. 

5. An isolated and purified polynucleotide sequence of claim 2 
consisting of a polynucleotide sequence that hybridizes under stringent 
hybridization conditions to the sequence of SEQ ID NO: 2. 

6. A recombinant expression vector containing a polynucleotide 

sequence of claim 2. 

7. A recombinant host cell comprising $ polynucleotide sequence of 

claim 2. 

8. A method for producing a polypeptide comprising the amino acid 
sequence shown in SEQ ID N0:1, the method comprising the steps of: 

a) culturing the host cell of Claim 7 under conditions 
suitable for the expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

9. A recombinant expression vector containing a polynucleotide 
sequence of claim 5. 

10. A recombinant host cell comprising a polynucleotide sequence 

of claim 9. 

11. A pharmaceutical composition comprising a substantially 
purified human NSPLP protein (SEQ ID N0:1) in conjunction with a suitable 
pharmaceutical carrier. 

12. A purified antibody which binds specifically to a polypeptide 

of claim 1 . 

13. A purified antagonist which specifically blocks or reduces the 
activity of the polypeptide of claim 1. 

14. A pharmaceutical composition comprising a substantially 
purified antagonist of the polypeptide of claim 1 in conjunction with a 
suitable pharmaceutical carrier. 

15. A substantially purified human NSPLP protein comprising the 
amino acid sequence of SEQ ID NO: 3 or fragments thereof. 

16. An isolated and purified polynucleotide sequence encoding a 
protein of claim 15. 

17. An isolated and purified polynucleotide sequence of claim 16 
consisting of the sequence of SEQ ID NO: 4 or degenerate variants thereof. 

18. A polynucleotide sequence fully complementary to the sequence 
of SEQ ID NO: 4 or degenerate variants thereof. 
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19. An isolated and purified polynucleotide sequence of claim 16 
consisting of a polynucleotide sequence that hybridizes under stringent 
hybridization conditions to the sequence of SEQ ID NO: 4. 

20. A recombinant expression vector containing a polynucleotide 
5 sequence of claim 16. 

21. A recombinant host cell comprising a polynucleotide sequence 
of claim 16^. 

22. A method for producing a polypeptide comprising the amino acid 
sequence shown m SEQ ID NO: 3, the method comprising the steps of: 

10 a) culturing the host cell of Claim 21 under conditions 

suitable for the expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

23. A recombinant expression vector containing a polynucleotide 
sequence of claim 19. 

15 24 . A recombinant host cell comprising a polynucleotide sequence 

of claim 23. 

25. A pharmaceutical composition comprising a substantially 
purified human NSPLP protein (SEQ ID NO: 3) in conjunction with a suitable 
pharmaceutical carrier. 
20 26. A purified antibody which binds specifically to a polypeptide 

of claim 15. 

27. A purified antagonist which specifically blocks or reduces the 
activity of the polypeptide of claim 15. 

28. A pharmaceutical composition comprising a substantially 

25 purified antagonist of the polypeptide of claim 15 in conjunction with a 

suitable pharmaceutical carrier. 
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Th« IntemaUonal Search Report has not been established in respect of certain ciaima under Arttde 17(2}(a) tor the toUowing reasons: 
1. I I QaimsNos.: 

— because they relate to subject matter not required to tie searched by this Authority, namely: 



2. Cliimt Nos.: 

because they relate to parts of the International AppUcaiton that do not comply with the prescribed requtrements to such 
an extent that no meaningful International Search can be earned out, speaftoally: 



3. I I Ctatms Nos.: 

because they are dependent claims and are not drafted in accordance with the second and thinj semertcea of Rule 6.4(a). 



Box II Observations where unity of Invention is lacking (Continuation of Item 2 of first sheet) 



This International Searching Authority found muttiple inventions in this international application, as foUowe: 

See annex 



1 , I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
1—1 searchable claims. 



2, [ I As all aeafchable daims could be searched without effort fustifying an additional fee, this Authonty did not tnvite payment 
of any additional fee. 



3. rn As only some of the required additional search tees were timely paid by the applicant, this International Search Report 
*— * covers only those daims for which fees were paid, speafically claims Nos.; 



4. [X] No required additional search fees were timely paid toy the applicant. Consequently, this tntemational Search Report is 
restneted to the invention first menuoned in the daims; it is covered by daims Nos.: 

1-U 



Remark on Protest [ | The additional search fees were accompanied l>y the applicant's protest. 

I j No protest accompanied the payment of additionai search fees. 



Form PCT/tSA/21 0 (continuation of first sheet (1 }) (July 1992) 



Intemattonai Application No. PCT/ US 97/13469 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



1. Claims: 1-14 

Human NSPLP protein as in Seq.ID:! or fragments thereof. 
Corresponding polynucleotide sequence as in Seq.ID:2 and 
variants. Expression vector, transformed host cell and 
method for production of the polypeptide. Specific antibody 
and antagonist of the polypeptide. Pharmaceutical 
composition comprising the polypeptide or the antagonist. 



2. Claims: 15-28 

Human NSPLP protein as in Seq.ID:3 or fragments thereof. 
Corresponding polynucleotide sequence as in Seq-ID:4 and 
variants. Expression vector, transformed host cell and 
method for production of the polypeptide. Specific antibody 
and antagonist of the polypeptide. Pharmaceutical 
composition comprising the polypeptide or the antagonist. 



